Deep learning-based reconstruction of shredded documents

Paixão, Thiago Meireles

Deep learning-based reconstruction of shredded documents

Detalhes bibliográficos
Ano de defesa:	2022
Autor(a) principal:	Paixão, Thiago Meireles
Orientador(a):	Santos, Thiago Oliveira dos
Banca de defesa:	Boldt, Francisco de Assis , Hirata Junior, Roberto , Britto Junior, Alceu de Souza
Tipo de documento:	Tese
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal do Espírito Santo Doutorado em Ciência da Computação
Programa de Pós-Graduação:	Programa de Pós-Graduação em Informática
Departamento:	Centro Tecnológico
País:	BR
Palavras-chave em Português:	Reconstrução de documentos fragmentados Avaliação de compatibilidade Problema de quebra cabeça Otimização combinatorial
Área do conhecimento CNPq:	Ciência da Computação
Link de acesso:	http://repositorio.ufes.br/handle/10/15982
Resumo:	The reconstruction of shredded documents is a relevant task in various domains, such as forensic investigation and history reconstruction. As an alternative for the manual reconstruction, researchers have been investigating ways to perform (semi-)automatic digital reconstruction. Despite the several works on this topic, dealing with real-shredded data is a very sensitive issue in the current literature. Two research directions are addressed in this thesis to face this scenario: properly evaluating the fitting of shreds (the bulk of this work) and integrating the human into the reconstruction process. Regarding the fitting (compatibility) evaluation, it was verified that traditional pixel based approaches are not robust to real shredding, while more sophisticated techniques compromise significantly time performance. This thesis presents two deep learning self supervised approaches that have achieved state-of-the-art accuracy in more realistic/complex scenarios involving several real-shredded documents where the shreds are mixed (multi-page reconstruction or multi-reconstruction). The first approach models the compatibility evaluation as a two-class (valid or invalid) pattern recognition problem. The second approach, based on deep metric learning, proposes decoupling feature extraction from compatibility evaluation to improve scalability (time performance) for large reconstruction instances. Human interaction is explored to improve the accuracy of automatic methods. A critical issue regarding this topic is that the proposed methods do not scale well for large instances (real scenario), either because the user has the entire responsibility of arranging the shreds, or because he/she has to visualize the reconstruction and designate the shreds to be analyzed. In face of this challenge, we propose a human-in-the-loop framework that automatically selects potential mistakes (wrong pairings) in the solution for user analysis.

Metadados do item

id	UFES_68ecc9667ca35fff65af5a1efd120027
oai_identifier_str	oai:repositorio.ufes.br:10/15982
network_acronym_str	UFES
network_name_str	Repositório Institucional da Universidade Federal do Espírito Santo (riUfes)
repository_id_str
spelling	Santos, Thiago Oliveira doshttps://orcid.org/0000-0001-7607-635Xhttp://lattes.cnpq.br/5117339495064254Paixão, Thiago Meireleshttps://orcid.org/0000000315546834http://lattes.cnpq.br/2961730349897943Boldt, Francisco de Assishttps://orcid.org/0000-0001-6919-5377http://lattes.cnpq.br/0385991152092556Hirata Junior, Robertohttps://orcid.org/0000-0003-3861-7260http://lattes.cnpq.br/1647118503085126Britto Junior, Alceu de Souzahttp://lattes.cnpq.br/4251936710939364 2024-05-30T00:53:25Z2024-05-30T00:53:25Z2022-05-10The reconstruction of shredded documents is a relevant task in various domains, such as forensic investigation and history reconstruction. As an alternative for the manual reconstruction, researchers have been investigating ways to perform (semi-)automatic digital reconstruction. Despite the several works on this topic, dealing with real-shredded data is a very sensitive issue in the current literature. Two research directions are addressed in this thesis to face this scenario: properly evaluating the fitting of shreds (the bulk of this work) and integrating the human into the reconstruction process. Regarding the fitting (compatibility) evaluation, it was verified that traditional pixel based approaches are not robust to real shredding, while more sophisticated techniques compromise significantly time performance. This thesis presents two deep learning self supervised approaches that have achieved state-of-the-art accuracy in more realistic/complex scenarios involving several real-shredded documents where the shreds are mixed (multi-page reconstruction or multi-reconstruction). The first approach models the compatibility evaluation as a two-class (valid or invalid) pattern recognition problem. The second approach, based on deep metric learning, proposes decoupling feature extraction from compatibility evaluation to improve scalability (time performance) for large reconstruction instances. Human interaction is explored to improve the accuracy of automatic methods. A critical issue regarding this topic is that the proposed methods do not scale well for large instances (real scenario), either because the user has the entire responsibility of arranging the shreds, or because he/she has to visualize the reconstruction and designate the shreds to be analyzed. In face of this challenge, we propose a human-in-the-loop framework that automatically selects potential mistakes (wrong pairings) in the solution for user analysis.A reconstrução de documentos fragmentados é uma tarefa importante em diversas situações, tais como na investigação forense e na reconstrução de fatos históricos. Como alternativa ao processo manual, pesquisadores têm desenvolvido métodos para reconstruir documentos (semi-)automaticamente no domínio digital. Apesar dos diversos trabalhos na área, tratar adequadamente dados reais obtidos com uso de máquinas fragmentadoras é um problema crítico na literatura. Neste contexto, duas direções de pesquisa foram abordadas nesta tese: a avaliação robusta de compatibilidade entre os fragmentos, que é o foco do nosso trabalho, e a interação homem-máquina no processo de reconstrução. Com respeito à avaliação de compatibilidade, verificou-se que as técnicas tradicionais baseadas em análise de pixel não são robustas à fragmentação real, enquanto técnicas mais sofisticadas comprometem significativamente a eficiência (tempo de processamento). Esta tese propõe duas abordagens baseadas em deep learning para cenários mais complexos/realísticos envolvendo, além da fragmentação mecânica, a mistura de fragmentos provenientes de diversas páginas de documentos (multi-page reconstruction ou multireconstruction). A primeira abordagem modela a avaliação de compatibilidade como um problema de reconhecimento de padrões envolvendo duas classes (válida e inválida). A segunda abordagem, baseada no paradigma deep metric learning, propõe separar as etapas de extração de características e avaliação de compatibilidade para melhor eficiência na reconstrução de maiores instâncias de reconstrução. Ainteração humana é explorada num segundo momento para se obter maior acurácia comparada aos métodos automáticos. Em relação a este tema, um fator crítico é que os métodos propostos na literatura não escalam eficientemente com o aumento do número de fragmentos (cenário mais realístico). Isso se deve ao fato do usuário ser totalmente responsável pela organização dos fragmentos, e/ou porque ele precisa visualizar todo o documento reconstruído para designar fragmentos a serem analisados. Diante deste desafio, propusemos um framework que explora a interação homem-máquina e que automaticamente seleciona potenciais erros na solução (pareamentos incorretos) para serem analisados pelo usuário.Texthttp://repositorio.ufes.br/handle/10/15982porUniversidade Federal do Espírito SantoDoutorado em Ciência da ComputaçãoPrograma de Pós-Graduação em InformáticaUFESBRCentro Tecnológicosubject.br-rjbnCiência da ComputaçãoReconstrução de documentos fragmentadosAvaliação de compatibilidadeProblema de quebra cabeçaOtimização combinatorialDeep learning-based reconstruction of shredded documentsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da Universidade Federal do Espírito Santo (riUfes)instname:Universidade Federal do Espírito Santo (UFES)instacron:UFESORIGINALThiagoMeirelesPaixao-2022-tese.pdfapplication/pdf60622961http://repositorio.ufes.br/bitstreams/4f07f077-33d1-4e3f-b9cc-c4f8fbc3ca31/download28056b0e0081c23ea2ccee444a379ca1MD5110/159822025-08-13 13:49:47.374oai:repositorio.ufes.br:10/15982http://repositorio.ufes.brRepositório InstitucionalPUBhttp://repositorio.ufes.br/oai/requestriufes@ufes.bropendoar:21082025-08-13T13:49:47Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) - Universidade Federal do Espírito Santo (UFES)false
dc.title.none.fl_str_mv	Deep learning-based reconstruction of shredded documents
title	Deep learning-based reconstruction of shredded documents
spellingShingle	Deep learning-based reconstruction of shredded documents Paixão, Thiago Meireles Ciência da Computação Reconstrução de documentos fragmentados Avaliação de compatibilidade Problema de quebra cabeça Otimização combinatorial subject.br-rjbn
title_short	Deep learning-based reconstruction of shredded documents
title_full	Deep learning-based reconstruction of shredded documents
title_fullStr	Deep learning-based reconstruction of shredded documents
title_full_unstemmed	Deep learning-based reconstruction of shredded documents
title_sort	Deep learning-based reconstruction of shredded documents
author	Paixão, Thiago Meireles
author_facet	Paixão, Thiago Meireles
author_role	author
dc.contributor.authorID.none.fl_str_mv	https://orcid.org/0000000315546834
dc.contributor.authorLattes.none.fl_str_mv	http://lattes.cnpq.br/2961730349897943
dc.contributor.advisor1.fl_str_mv	Santos, Thiago Oliveira dos
dc.contributor.advisor1ID.fl_str_mv	https://orcid.org/0000-0001-7607-635X
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/5117339495064254
dc.contributor.author.fl_str_mv	Paixão, Thiago Meireles
dc.contributor.referee1.fl_str_mv	Boldt, Francisco de Assis
dc.contributor.referee1ID.fl_str_mv	https://orcid.org/0000-0001-6919-5377
dc.contributor.referee1Lattes.fl_str_mv	http://lattes.cnpq.br/0385991152092556
dc.contributor.referee2.fl_str_mv	Hirata Junior, Roberto
dc.contributor.referee2ID.fl_str_mv	https://orcid.org/0000-0003-3861-7260
dc.contributor.referee2Lattes.fl_str_mv	http://lattes.cnpq.br/1647118503085126
dc.contributor.referee3.fl_str_mv	Britto Junior, Alceu de Souza
dc.contributor.referee3Lattes.fl_str_mv	http://lattes.cnpq.br/4251936710939364
contributor_str_mv	Santos, Thiago Oliveira dos Boldt, Francisco de Assis Hirata Junior, Roberto Britto Junior, Alceu de Souza
dc.subject.cnpq.fl_str_mv	Ciência da Computação
topic	Ciência da Computação Reconstrução de documentos fragmentados Avaliação de compatibilidade Problema de quebra cabeça Otimização combinatorial subject.br-rjbn
dc.subject.por.fl_str_mv	Reconstrução de documentos fragmentados Avaliação de compatibilidade Problema de quebra cabeça Otimização combinatorial
dc.subject.br-rjbn.none.fl_str_mv	subject.br-rjbn
description	The reconstruction of shredded documents is a relevant task in various domains, such as forensic investigation and history reconstruction. As an alternative for the manual reconstruction, researchers have been investigating ways to perform (semi-)automatic digital reconstruction. Despite the several works on this topic, dealing with real-shredded data is a very sensitive issue in the current literature. Two research directions are addressed in this thesis to face this scenario: properly evaluating the fitting of shreds (the bulk of this work) and integrating the human into the reconstruction process. Regarding the fitting (compatibility) evaluation, it was verified that traditional pixel based approaches are not robust to real shredding, while more sophisticated techniques compromise significantly time performance. This thesis presents two deep learning self supervised approaches that have achieved state-of-the-art accuracy in more realistic/complex scenarios involving several real-shredded documents where the shreds are mixed (multi-page reconstruction or multi-reconstruction). The first approach models the compatibility evaluation as a two-class (valid or invalid) pattern recognition problem. The second approach, based on deep metric learning, proposes decoupling feature extraction from compatibility evaluation to improve scalability (time performance) for large reconstruction instances. Human interaction is explored to improve the accuracy of automatic methods. A critical issue regarding this topic is that the proposed methods do not scale well for large instances (real scenario), either because the user has the entire responsibility of arranging the shreds, or because he/she has to visualize the reconstruction and designate the shreds to be analyzed. In face of this challenge, we propose a human-in-the-loop framework that automatically selects potential mistakes (wrong pairings) in the solution for user analysis.
publishDate	2022
dc.date.issued.fl_str_mv	2022-05-10
dc.date.accessioned.fl_str_mv	2024-05-30T00:53:25Z
dc.date.available.fl_str_mv	2024-05-30T00:53:25Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/doctoralThesis
format	doctoralThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://repositorio.ufes.br/handle/10/15982
url	http://repositorio.ufes.br/handle/10/15982
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	Text
dc.publisher.none.fl_str_mv	Universidade Federal do Espírito Santo Doutorado em Ciência da Computação
dc.publisher.program.fl_str_mv	Programa de Pós-Graduação em Informática
dc.publisher.initials.fl_str_mv	UFES
dc.publisher.country.fl_str_mv	BR
dc.publisher.department.fl_str_mv	Centro Tecnológico
publisher.none.fl_str_mv	Universidade Federal do Espírito Santo Doutorado em Ciência da Computação
dc.source.none.fl_str_mv	reponame:Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) instname:Universidade Federal do Espírito Santo (UFES) instacron:UFES
instname_str	Universidade Federal do Espírito Santo (UFES)
instacron_str	UFES
institution	UFES
reponame_str	Repositório Institucional da Universidade Federal do Espírito Santo (riUfes)
collection	Repositório Institucional da Universidade Federal do Espírito Santo (riUfes)
bitstream.url.fl_str_mv	http://repositorio.ufes.br/bitstreams/4f07f077-33d1-4e3f-b9cc-c4f8fbc3ca31/download
bitstream.checksum.fl_str_mv	28056b0e0081c23ea2ccee444a379ca1
bitstream.checksumAlgorithm.fl_str_mv	MD5
repository.name.fl_str_mv	Repositório Institucional da Universidade Federal do Espírito Santo (riUfes) - Universidade Federal do Espírito Santo (UFES)
repository.mail.fl_str_mv	riufes@ufes.br
_version_	1856037471748358144

Deep learning-based reconstruction of shredded documents

Registros relacionados