Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente

Santos, Rafael Menêses

Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente

Detalhes bibliográficos
Ano de defesa:	2016
Autor(a) principal:	Santos, Rafael Menêses
Orientador(a):	Matos, Leonardo Nogueira
Banca de defesa:	Não Informado pela instituição
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal de Sergipe
Programa de Pós-Graduação:	Pós-Graduação em Ciência da Computação
Departamento:	Não Informado pela instituição
País:	Brasil
Palavras-chave em Português:	Computação Redes neurais (Computação) Reconhecimento automático da voz Processos de Markov Convolucionais HMM Reconhecimento de fala
Palavras-chave em Inglês:	Speech recognition Convolutional neural networks
Área do conhecimento CNPq:	CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Link de acesso:	https://ri.ufs.br/handle/riufs/3363
Resumo:	One of the biggest challenges in speech recognition today is its use on a daily basis, in which distortion and noise in the environment are present and hinder this task. In the last thirty years, hundreds of methods for noise-robust recognition were proposed, each with its own advantages and disadvantages. In this thesis, the use of Convolutional Neural Networks (CNN) as acoustic models in automatic speech recognition systems (ASR) is proposed as an alternative to the classical recognition methods based on Hidden Markov Models (HMM) without any noise-robust method applied. Experiments were performed with a audio set modified by additive and natural noises, and showed that the presented method reduces the Equal Error Rate (EER) and improves the acuracy of speech recognition in noisy environments when compared to traditional models of classifiation, indicating the robustness of the approach.

Metadados do item

id	UFS-2_3eda4c1391202494d5227dc82adb1a0d
oai_identifier_str	oai:ufs.br:riufs/3363
network_acronym_str	UFS-2
network_name_str	Repositório Institucional da UFS
repository_id_str
spelling	Santos, Rafael MenêsesMatos, Leonardo NogueiraMacedo, Hendrik Teixeirahttp://lattes.cnpq.br/17450814187972732017-09-26T11:34:29Z2017-09-26T11:34:29Z2016-05-30SANTOS, Rafael Menêses. Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente. 2016. 40 f. Dissertação (Pós-Graduação em Ciência da Computação) - Universidade Federal de Sergipe, São Cristóvão, SE, 2016.https://ri.ufs.br/handle/riufs/3363One of the biggest challenges in speech recognition today is its use on a daily basis, in which distortion and noise in the environment are present and hinder this task. In the last thirty years, hundreds of methods for noise-robust recognition were proposed, each with its own advantages and disadvantages. In this thesis, the use of Convolutional Neural Networks (CNN) as acoustic models in automatic speech recognition systems (ASR) is proposed as an alternative to the classical recognition methods based on Hidden Markov Models (HMM) without any noise-robust method applied. Experiments were performed with a audio set modified by additive and natural noises, and showed that the presented method reduces the Equal Error Rate (EER) and improves the acuracy of speech recognition in noisy environments when compared to traditional models of classifiation, indicating the robustness of the approach.Um dos maiores desafios no reconhecimento de fala atualmente é usá-lo no contexto diário, no qual distorções no sinal da fala e ruídos no ambiente estão presentes e re- duzem a qualidade do reconhecimento. Nos últimos trinta anos, centenas de métodos para reconhecimento robusto ao ruído foram propostos, cada um com suas vantagens e desvantagens. Este trabalho propõe o uso de uma rede neural convolucional no papel de modelo acústico em sistemas de reconhecimento automático de fala,como uma alter- nativa ao métodos clássicos de reconhecimento baseado em modelos ocultos de Markov (HMM, do inglês, Hidden Markov Models) sem a aplicação de um método robusto ao ruído. Experimentos foram realizados com áudios modi ficados com ruídos aditivos e reais, e mostraram que o método proposto reduz o Equal Error Rate (EER) e aumenta a acurácia da classificação de comando de voz quando comparado a modelos tradicionais de classificação, evidenciando a robustez da abordagem apresentada.application/pdfporUniversidade Federal de SergipePós-Graduação em Ciência da ComputaçãoUFSBrasilComputaçãoRedes neurais (Computação)Reconhecimento automático da vozProcessos de MarkovConvolucionaisHMMReconhecimento de falaSpeech recognitionConvolutional neural networksCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOUma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambienteinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFSinstname:Universidade Federal de Sergipe (UFS)instacron:UFSTEXTRAFAEL_MENESES_SANTOS.pdf.txtRAFAEL_MENESES_SANTOS.pdf.txtExtracted texttext/plain59361https://ri.ufs.br/jspui/bitstream/riufs/3363/2/RAFAEL_MENESES_SANTOS.pdf.txt358fbbd2203a95e61e2d86961e2f0f96MD52THUMBNAILRAFAEL_MENESES_SANTOS.pdf.jpgRAFAEL_MENESES_SANTOS.pdf.jpgGenerated Thumbnailimage/jpeg1175https://ri.ufs.br/jspui/bitstream/riufs/3363/3/RAFAEL_MENESES_SANTOS.pdf.jpg640dfa15b79a101574fc59c18a9ce4f2MD53ORIGINALRAFAEL_MENESES_SANTOS.pdfapplication/pdf2189611https://ri.ufs.br/jspui/bitstream/riufs/3363/1/RAFAEL_MENESES_SANTOS.pdf0f0b24f0e304c633783f5e0847924350MD51riufs/33632017-11-24 21:35:09.862oai:ufs.br:riufs/3363Repositório InstitucionalPUBhttps://ri.ufs.br/oai/requestrepositorio@academico.ufs.bropendoar:2017-11-25T00:35:09Repositório Institucional da UFS - Universidade Federal de Sergipe (UFS)false
dc.title.por.fl_str_mv	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente
title	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente
spellingShingle	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente Santos, Rafael Menêses Computação Redes neurais (Computação) Reconhecimento automático da voz Processos de Markov Convolucionais HMM Reconhecimento de fala Speech recognition Convolutional neural networks CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente
title_full	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente
title_fullStr	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente
title_full_unstemmed	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente
title_sort	Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente
author	Santos, Rafael Menêses
author_facet	Santos, Rafael Menêses
author_role	author
dc.contributor.author.fl_str_mv	Santos, Rafael Menêses
dc.contributor.advisor1.fl_str_mv	Matos, Leonardo Nogueira
dc.contributor.advisor-co1.fl_str_mv	Macedo, Hendrik Teixeira
dc.contributor.authorLattes.fl_str_mv	http://lattes.cnpq.br/1745081418797273
contributor_str_mv	Matos, Leonardo Nogueira Macedo, Hendrik Teixeira
dc.subject.por.fl_str_mv	Computação Redes neurais (Computação) Reconhecimento automático da voz Processos de Markov Convolucionais HMM Reconhecimento de fala
topic	Computação Redes neurais (Computação) Reconhecimento automático da voz Processos de Markov Convolucionais HMM Reconhecimento de fala Speech recognition Convolutional neural networks CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.eng.fl_str_mv	Speech recognition Convolutional neural networks
dc.subject.cnpq.fl_str_mv	CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	One of the biggest challenges in speech recognition today is its use on a daily basis, in which distortion and noise in the environment are present and hinder this task. In the last thirty years, hundreds of methods for noise-robust recognition were proposed, each with its own advantages and disadvantages. In this thesis, the use of Convolutional Neural Networks (CNN) as acoustic models in automatic speech recognition systems (ASR) is proposed as an alternative to the classical recognition methods based on Hidden Markov Models (HMM) without any noise-robust method applied. Experiments were performed with a audio set modified by additive and natural noises, and showed that the presented method reduces the Equal Error Rate (EER) and improves the acuracy of speech recognition in noisy environments when compared to traditional models of classifiation, indicating the robustness of the approach.
publishDate	2016
dc.date.issued.fl_str_mv	2016-05-30
dc.date.accessioned.fl_str_mv	2017-09-26T11:34:29Z
dc.date.available.fl_str_mv	2017-09-26T11:34:29Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	SANTOS, Rafael Menêses. Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente. 2016. 40 f. Dissertação (Pós-Graduação em Ciência da Computação) - Universidade Federal de Sergipe, São Cristóvão, SE, 2016.
dc.identifier.uri.fl_str_mv	https://ri.ufs.br/handle/riufs/3363
identifier_str_mv	SANTOS, Rafael Menêses. Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente. 2016. 40 f. Dissertação (Pós-Graduação em Ciência da Computação) - Universidade Federal de Sergipe, São Cristóvão, SE, 2016.
url	https://ri.ufs.br/handle/riufs/3363
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal de Sergipe
dc.publisher.program.fl_str_mv	Pós-Graduação em Ciência da Computação
dc.publisher.initials.fl_str_mv	UFS
dc.publisher.country.fl_str_mv	Brasil
publisher.none.fl_str_mv	Universidade Federal de Sergipe
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFS instname:Universidade Federal de Sergipe (UFS) instacron:UFS
instname_str	Universidade Federal de Sergipe (UFS)
instacron_str	UFS
institution	UFS
reponame_str	Repositório Institucional da UFS
collection	Repositório Institucional da UFS
bitstream.url.fl_str_mv	https://ri.ufs.br/jspui/bitstream/riufs/3363/2/RAFAEL_MENESES_SANTOS.pdf.txt https://ri.ufs.br/jspui/bitstream/riufs/3363/3/RAFAEL_MENESES_SANTOS.pdf.jpg https://ri.ufs.br/jspui/bitstream/riufs/3363/1/RAFAEL_MENESES_SANTOS.pdf
bitstream.checksum.fl_str_mv	358fbbd2203a95e61e2d86961e2f0f96 640dfa15b79a101574fc59c18a9ce4f2 0f0b24f0e304c633783f5e0847924350
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da UFS - Universidade Federal de Sergipe (UFS)
repository.mail.fl_str_mv	repositorio@academico.ufs.br
_version_	1802111126211657728

Uma abordagem híbrida CNN-HMM para reconhecimento de fala tolerante a ruídos de ambiente

Registros relacionados