Deep learning approaches for soft biometrics classification in videos

Aquino, Nelson Marcelo Romero

Deep learning approaches for soft biometrics classification in videos

Detalhes bibliográficos
Ano de defesa:	2018
Autor(a) principal:	Aquino, Nelson Marcelo Romero
Orientador(a):	Não Informado pela instituição
Banca de defesa:	Não Informado pela instituição
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	eng
Instituição de defesa:	Universidade Tecnológica Federal do Paraná Curitiba Brasil Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial UTFPR
Programa de Pós-Graduação:	Não Informado pela instituição
Departamento:	Não Informado pela instituição
País:	Não Informado pela instituição
Palavras-chave em Português:	Biometria Computação Aprendizado do computador Processamento de imagens - Técnicas digitais Identificação biométrica Visão por computador Métodos de simulação Engenharia elétrica Biometry Computer science Machine learning Image processing - Digital techniques Biometric identification Computer vision Simulation methods Electric engineering CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO Engenharia Elétrica
Link de acesso:	http://repositorio.utfpr.edu.br/jspui/handle/1/3173
Resumo:	The number of surveillance cameras installed in public places has grown enormously during the past years due to the necessity to increase public security, allowing to obtain a large amount of images and videos in real time without much effort. Different types of problems can be solved by processing the data obtained by security cameras, such as the identification of individuals. Soft biometrics attributes can be useful to perform this task, since they provide information that can be used to differentiate one person from another without requiring their direct cooperation. However, this demands an exhaustive process of analysis to be carried by one or more human observers. Depending on the quantity of cameras, this could even become an impossible task for humans. Hence, computer vision methods could be a valid alternative to perform soft biometric classification in images or videos. Within this score, Deep Learning (DL) methods have risen recently, achieving state-of-the art performances for several computer vision tasks such as object recognition, object detection and image segmentation. This is possible due to their capability to learn both, features and classifier, at once, in order to solve a particular problem. Following this line, this work aims at empirically studying the suitability of DL methods for classifying soft biometrics in images or videos. We present three contributions regarding this subject in this dissertation. First, we perform a study on the effect of data augmentation on the performance of convolutional neural networks for soft biometrics classification. The second contribution is related to transferring information from one soft biometric dataset into another to perform classification. This process is achieved by training a model with data from a dataset in order to test it on data from another one. Finally, we evaluate the use of DL models to represent or learn temporal dependencies, so as to perform soft biometrics classification in videos. For this task, we propose a novel approach based on the use of bidirectional long short term memory networks. Results for the experiments regarding data augmentation show that large augmentation sizes do not induce overfitting and that balancing a dataset before performing on-line data augmentation leads to the necessity of smaller augmentation sizes in order to start improving the performance of the networks. As for transfer learning, results show that there could be a correlation between the complexity and the similarity of the datasets that are used for training and testing a model. Thus, if this technique is applied, the training set should preferably be very similar to the test data and should have a higher complexity. Although this is not definitive, since there could be exceptions depending on the soft biometric attribute to classify. Regarding video classification, in general, our approaches based on a recurrent network and a DL model that represents temporal dependencies through a low-pass filter yielded better results, in terms of overall accuracy and classification balance, than the baseline, based on classifying a video using only one of its frames.

Metadados do item

id	UTFPR-12_3f0dbeac6e6ade3479c84d502b19e91d
oai_identifier_str	oai:repositorio.utfpr.edu.br:1/3173
network_acronym_str	UTFPR-12
network_name_str	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
repository_id_str
spelling	Deep learning approaches for soft biometrics classification in videosAbordagens de aprendizado profundo para classificação de biometrias fracas em vídeoBiometriaComputaçãoAprendizado do computadorProcessamento de imagens - Técnicas digitaisIdentificação biométricaVisão por computadorMétodos de simulaçãoEngenharia elétricaBiometryComputer scienceMachine learningImage processing - Digital techniquesBiometric identificationComputer visionSimulation methodsElectric engineeringCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAOEngenharia ElétricaThe number of surveillance cameras installed in public places has grown enormously during the past years due to the necessity to increase public security, allowing to obtain a large amount of images and videos in real time without much effort. Different types of problems can be solved by processing the data obtained by security cameras, such as the identification of individuals. Soft biometrics attributes can be useful to perform this task, since they provide information that can be used to differentiate one person from another without requiring their direct cooperation. However, this demands an exhaustive process of analysis to be carried by one or more human observers. Depending on the quantity of cameras, this could even become an impossible task for humans. Hence, computer vision methods could be a valid alternative to perform soft biometric classification in images or videos. Within this score, Deep Learning (DL) methods have risen recently, achieving state-of-the art performances for several computer vision tasks such as object recognition, object detection and image segmentation. This is possible due to their capability to learn both, features and classifier, at once, in order to solve a particular problem. Following this line, this work aims at empirically studying the suitability of DL methods for classifying soft biometrics in images or videos. We present three contributions regarding this subject in this dissertation. First, we perform a study on the effect of data augmentation on the performance of convolutional neural networks for soft biometrics classification. The second contribution is related to transferring information from one soft biometric dataset into another to perform classification. This process is achieved by training a model with data from a dataset in order to test it on data from another one. Finally, we evaluate the use of DL models to represent or learn temporal dependencies, so as to perform soft biometrics classification in videos. For this task, we propose a novel approach based on the use of bidirectional long short term memory networks. Results for the experiments regarding data augmentation show that large augmentation sizes do not induce overfitting and that balancing a dataset before performing on-line data augmentation leads to the necessity of smaller augmentation sizes in order to start improving the performance of the networks. As for transfer learning, results show that there could be a correlation between the complexity and the similarity of the datasets that are used for training and testing a model. Thus, if this technique is applied, the training set should preferably be very similar to the test data and should have a higher complexity. Although this is not definitive, since there could be exceptions depending on the soft biometric attribute to classify. Regarding video classification, in general, our approaches based on a recurrent network and a DL model that represents temporal dependencies through a low-pass filter yielded better results, in terms of overall accuracy and classification balance, than the baseline, based on classifying a video using only one of its frames.O número de câmeras de vigilância instaladas em locais públicos cresceu enormemente nos últimos anos devido à necessidade de aumentar a segurança pública, permitindo obter uma grande quantidade de imagens e vídeos em tempo real sem muito esforço. Diferentes tipos de problemas podem ser resolvidos através do processamento dos dados obtidos por estas câmeras, como a identificação de indivíduos. As biometrias fracas podem ser úteis para executar esta tarefa, uma vez que elas fornecem informações que podem ser usadas para diferenciar uma pessoa de outra sem exigir a cooperação direta delas. No entanto, isso exige uma tarefa exaustiva de análise a ser feita por observadores humanos. Dependendo da quantidade de câmeras, isso pode até se tornar uma tarefa impossível. Métodos de visão computacional podem ser uma alternativa válida para realizar classificação de biometrias fracas em imagens ou vídeos. Os métodos de Deep Learning (DL) têm alcançado desempenhos muito bons em tarefas de visão computacional, como reconhecimento e detecção de objetos, ou segmentação de imagens. Seguindo esta linha, este trabalho tem como objetivo estudar a adequação de métodos de DL para classificar biometrias fracas em imagens ou vídeos. Três contribuições são apresentadas sobre este tema nesta dissertação. Primeiro, realizou-se um estudo sobre o efeito do aumento de dados no desempenho de redes neurais convolucionais para classificação de biometrias fracas em imagens. A segunda contribuição está relacionada com a transferência de informação de um conjunto de imagens a outro. Este processo se baseia em treinar um modelo com dados de uma distribuição e testá-lo em dados de outra distribuição. Finalmente, foi avaliado o uso de modelos de DL para realizar a classificação em vídeos. Para este propósito, foi proposta uma nova abordagem baseada no uso de redes de memória bidirecionais de longo e curto prazo. Resultados para os experimentos de aumento de dados mostram que grandes aumentos não induzem ao sobre-ajuste e que balancear um conjunto de dados antes do treino requer menor aumento para que o desempenho do modelo melhore. Quanto à transferência de informação, os resultados mostram que pode haver uma correlação entre a complexidade e similaridade dos conjuntos de dados que são utilizados para treinar e testar um modelo. Assim, se esta técnica for aplicada, o conjunto de treinamento deve preferencialmente ser muito semelhante ao do teste e deve ser de maior complexidade. Embora isso não seja definitivo, já que pode haver exceções dependendo da biometria fraca a classificar. Em termos de classificação de vídeo, em geral, nossas abordagens baseadas em uma rede neural recorrente e um modelo DL que representa dependências temporais através de um filtro passa-baixas produziram melhores resultados, em termos de acurácia geral e balanço de classificação, que uma abordagem baseada em classificar um vídeo usando apenas um de seus quadros.Universidade Tecnológica Federal do ParanáCuritibaBrasilPrograma de Pós-Graduação em Engenharia Elétrica e Informática IndustrialUTFPRLopes, Heitor Silvériohttp://lattes.cnpq.br/4045818083957064Lopes, Heitor SilvérioLazzaretti, André EugênioPerlin, Hugo AlbertoAquino, Nelson Marcelo Romero2018-05-21T19:50:31Z2018-05-21T19:50:31Z2018-03-02info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfAQUINO, Nelson Marcelo Romero. Deep learning approaches for soft biometrics classification in videos. 2018. 88 f. Dissertação (Mestrado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2018.http://repositorio.utfpr.edu.br/jspui/handle/1/3173enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))instname:Universidade Tecnológica Federal do Paraná (UTFPR)instacron:UTFPR2018-12-10T18:23:31Zoai:repositorio.utfpr.edu.br:1/3173Repositório InstitucionalPUBhttp://repositorio.utfpr.edu.br:8080/oai/requestriut@utfpr.edu.br \|\| sibi@utfpr.edu.bropendoar:2018-12-10T18:23:31Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR)false
dc.title.none.fl_str_mv	Deep learning approaches for soft biometrics classification in videos Abordagens de aprendizado profundo para classificação de biometrias fracas em vídeo
title	Deep learning approaches for soft biometrics classification in videos
spellingShingle	Deep learning approaches for soft biometrics classification in videos Aquino, Nelson Marcelo Romero Biometria Computação Aprendizado do computador Processamento de imagens - Técnicas digitais Identificação biométrica Visão por computador Métodos de simulação Engenharia elétrica Biometry Computer science Machine learning Image processing - Digital techniques Biometric identification Computer vision Simulation methods Electric engineering CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO Engenharia Elétrica
title_short	Deep learning approaches for soft biometrics classification in videos
title_full	Deep learning approaches for soft biometrics classification in videos
title_fullStr	Deep learning approaches for soft biometrics classification in videos
title_full_unstemmed	Deep learning approaches for soft biometrics classification in videos
title_sort	Deep learning approaches for soft biometrics classification in videos
author	Aquino, Nelson Marcelo Romero
author_facet	Aquino, Nelson Marcelo Romero
author_role	author
dc.contributor.none.fl_str_mv	Lopes, Heitor Silvério http://lattes.cnpq.br/4045818083957064 Lopes, Heitor Silvério Lazzaretti, André Eugênio Perlin, Hugo Alberto
dc.contributor.author.fl_str_mv	Aquino, Nelson Marcelo Romero
dc.subject.por.fl_str_mv	Biometria Computação Aprendizado do computador Processamento de imagens - Técnicas digitais Identificação biométrica Visão por computador Métodos de simulação Engenharia elétrica Biometry Computer science Machine learning Image processing - Digital techniques Biometric identification Computer vision Simulation methods Electric engineering CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO Engenharia Elétrica
topic	Biometria Computação Aprendizado do computador Processamento de imagens - Técnicas digitais Identificação biométrica Visão por computador Métodos de simulação Engenharia elétrica Biometry Computer science Machine learning Image processing - Digital techniques Biometric identification Computer vision Simulation methods Electric engineering CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO Engenharia Elétrica
description	The number of surveillance cameras installed in public places has grown enormously during the past years due to the necessity to increase public security, allowing to obtain a large amount of images and videos in real time without much effort. Different types of problems can be solved by processing the data obtained by security cameras, such as the identification of individuals. Soft biometrics attributes can be useful to perform this task, since they provide information that can be used to differentiate one person from another without requiring their direct cooperation. However, this demands an exhaustive process of analysis to be carried by one or more human observers. Depending on the quantity of cameras, this could even become an impossible task for humans. Hence, computer vision methods could be a valid alternative to perform soft biometric classification in images or videos. Within this score, Deep Learning (DL) methods have risen recently, achieving state-of-the art performances for several computer vision tasks such as object recognition, object detection and image segmentation. This is possible due to their capability to learn both, features and classifier, at once, in order to solve a particular problem. Following this line, this work aims at empirically studying the suitability of DL methods for classifying soft biometrics in images or videos. We present three contributions regarding this subject in this dissertation. First, we perform a study on the effect of data augmentation on the performance of convolutional neural networks for soft biometrics classification. The second contribution is related to transferring information from one soft biometric dataset into another to perform classification. This process is achieved by training a model with data from a dataset in order to test it on data from another one. Finally, we evaluate the use of DL models to represent or learn temporal dependencies, so as to perform soft biometrics classification in videos. For this task, we propose a novel approach based on the use of bidirectional long short term memory networks. Results for the experiments regarding data augmentation show that large augmentation sizes do not induce overfitting and that balancing a dataset before performing on-line data augmentation leads to the necessity of smaller augmentation sizes in order to start improving the performance of the networks. As for transfer learning, results show that there could be a correlation between the complexity and the similarity of the datasets that are used for training and testing a model. Thus, if this technique is applied, the training set should preferably be very similar to the test data and should have a higher complexity. Although this is not definitive, since there could be exceptions depending on the soft biometric attribute to classify. Regarding video classification, in general, our approaches based on a recurrent network and a DL model that represents temporal dependencies through a low-pass filter yielded better results, in terms of overall accuracy and classification balance, than the baseline, based on classifying a video using only one of its frames.
publishDate	2018
dc.date.none.fl_str_mv	2018-05-21T19:50:31Z 2018-05-21T19:50:31Z 2018-03-02
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	AQUINO, Nelson Marcelo Romero. Deep learning approaches for soft biometrics classification in videos. 2018. 88 f. Dissertação (Mestrado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2018. http://repositorio.utfpr.edu.br/jspui/handle/1/3173
identifier_str_mv	AQUINO, Nelson Marcelo Romero. Deep learning approaches for soft biometrics classification in videos. 2018. 88 f. Dissertação (Mestrado em Engenharia Elétrica e Informática Industrial) - Universidade Tecnológica Federal do Paraná, Curitiba, 2018.
url	http://repositorio.utfpr.edu.br/jspui/handle/1/3173
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Tecnológica Federal do Paraná Curitiba Brasil Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial UTFPR
publisher.none.fl_str_mv	Universidade Tecnológica Federal do Paraná Curitiba Brasil Programa de Pós-Graduação em Engenharia Elétrica e Informática Industrial UTFPR
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) instname:Universidade Tecnológica Federal do Paraná (UTFPR) instacron:UTFPR
instname_str	Universidade Tecnológica Federal do Paraná (UTFPR)
instacron_str	UTFPR
institution	UTFPR
reponame_str	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
collection	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
repository.name.fl_str_mv	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR)
repository.mail.fl_str_mv	riut@utfpr.edu.br \|\| sibi@utfpr.edu.br
_version_	1850498304274595840

Deep learning approaches for soft biometrics classification in videos

Registros relacionados