Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens

Anderson Aparecido dos Santos

Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens

Detalhes bibliográficos
Ano de defesa:	2025
Autor(a) principal:	Anderson Aparecido dos Santos
Orientador(a):	Wesley Nunes Goncalves
Banca de defesa:	Não Informado pela instituição
Tipo de documento:	Tese
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Fundação Universidade Federal de Mato Grosso do Sul
Programa de Pós-Graduação:	Não Informado pela instituição
Departamento:	Não Informado pela instituição
País:	Brasil
Palavras-chave em Português:	Drones, Sensoriamento Remoto, Aprendizagem Profunda
Link de acesso:	https://repositorio.ufms.br/handle/123456789/12481
Resumo:	Detecting small objects in high-resolution images is a significant challenge in computer vision, especially in agricultural scenarios, where the size of insects tends to occupy only a few pixels relative to the entire image. Despite the advances brought by Convolutional Neural Networks (CNNs), the standard use of these architectures presents limitations, as the information associated with small objects tends to be lost due to downsampling and pooling operations. In this context, this thesis proposes strategies to improve the detection and segmentation of small objects. Initially, images are divided into smaller patches, and each patch is used individually to train and validate the models. For inference and prediction on the test set, the patches are overlapped, ensuring that in at least one of them, the object remains uncut. To address the issue of objects that end up being divided among different patches, techniques are presented to filter out-of-pattern predictions, those with low confidence scores, or with redundant and overlapping regions. To allow these techniques to also be applied to segmentation approaches, masks were generated from the original annotations, enabling the evaluation of both detection and segmentation models. Considering the limitations of the traditional Intersection over Union (IoU) metric for small objects, especially due to its sensitivity to minor spatial inaccuracies, this thesis also proposes an alternative metric based on the distance between the centers of the bounding boxes. The experimental results demonstrate that the proposed approaches contribute to the localization of small objects in high-resolution images, showing that both detection and segmentation techniques can be effective, as long as the data are properly processed before and after being input into the model.

Metadados do item

id	UFMS_a6d5d8da7a97e175a6c6ce998f650b5b
oai_identifier_str	oai:repositorio.ufms.br:123456789/12481
network_acronym_str	UFMS
network_name_str	Repositório Institucional da UFMS
repository_id_str
spelling	2025-08-27T18:03:06Z2025-08-27T18:03:06Z2025https://repositorio.ufms.br/handle/123456789/12481Detecting small objects in high-resolution images is a significant challenge in computer vision, especially in agricultural scenarios, where the size of insects tends to occupy only a few pixels relative to the entire image. Despite the advances brought by Convolutional Neural Networks (CNNs), the standard use of these architectures presents limitations, as the information associated with small objects tends to be lost due to downsampling and pooling operations. In this context, this thesis proposes strategies to improve the detection and segmentation of small objects. Initially, images are divided into smaller patches, and each patch is used individually to train and validate the models. For inference and prediction on the test set, the patches are overlapped, ensuring that in at least one of them, the object remains uncut. To address the issue of objects that end up being divided among different patches, techniques are presented to filter out-of-pattern predictions, those with low confidence scores, or with redundant and overlapping regions. To allow these techniques to also be applied to segmentation approaches, masks were generated from the original annotations, enabling the evaluation of both detection and segmentation models. Considering the limitations of the traditional Intersection over Union (IoU) metric for small objects, especially due to its sensitivity to minor spatial inaccuracies, this thesis also proposes an alternative metric based on the distance between the centers of the bounding boxes. The experimental results demonstrate that the proposed approaches contribute to the localization of small objects in high-resolution images, showing that both detection and segmentation techniques can be effective, as long as the data are properly processed before and after being input into the model.Detectar objetos pequenos em imagens de alta resolução é um desafio relevante na visão computacional, especialmente em cenários agrícolas, nos quais o tamanho dos insetos tende a ocupar poucos pixels em relação à imagem inteira. Apesar dos avanços proporcionados pelas Convolutional Neural Networks (CNNs), o uso padrão dessas arquiteturas apresenta limitações, pois as informações associadas a objetos pequenos tendem a se perder devido às operações de downsampling e pooling. Considerando esse contexto, esta tese propõe estratégias para aprimorar a detecção e segmentação de objetos pequenos. Inicialmente, as imagens são divididas em recortes menores, e cada recorte é utilizado individualmente para treinar e validar os modelos. Para a inferência e predição no conjunto de teste, os recortes são sobrepostos, garantindo que, em pelo menos um deles, o objeto não seja recortado. Assim, para contornar o problema de objetos que acabam sendo divididos entre diferentes recortes, são apresentadas técnicas para filtrar predições fora do padrão, com baixa pontuação de confiança ou com regiões redundantes e sobrepostas. Para permitir que as técnicas também fossem aplicadas em abordagens de segmentação, foram geradas máscaras a partir das anotações originais, possibilitando a avaliação tanto de modelos detectores quanto segmentadores. Considerando as limitações da métrica tradicional de Intersection over Union (IoU) para objetos pequenos, especialmente devido à sensibilidade a pequenas imprecisões espaciais, esta tese também propõe uma métrica alternativa baseada na distância entre os centros das caixas delimitadoras. Os resultados experimentais demonstram que as abordagens propostas contribuem para a localização de objetos pequenos em imagens de alta resolução, mostrando que tanto técnicas de detecção quanto de segmentação podem ser eficazes, desde que os dados sejam processados adequadamente antes e depois de passarem pelo modelo.Fundação Universidade Federal de Mato Grosso do SulUFMSBrasilDrones, Sensoriamento Remoto, Aprendizagem ProfundaDo Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagensinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisWesley Nunes GoncalvesAnderson Aparecido dos Santosinfo:eu-repo/semantics/openAccessporreponame:Repositório Institucional da UFMSinstname:Universidade Federal de Mato Grosso do Sul (UFMS)instacron:UFMSORIGINALtese_doutorado.pdftese_doutorado.pdfapplication/pdf44126266https://repositorio.ufms.br/bitstream/123456789/12481/-1/tese_doutorado.pdfabdfaa3937ad06e90ed803e04db5152dMD5-1123456789/124812025-08-27 14:03:14.588oai:repositorio.ufms.br:123456789/12481Repositório InstitucionalPUBhttps://repositorio.ufms.br/oai/requestri.prograd@ufms.bropendoar:21242025-08-27T18:03:14Repositório Institucional da UFMS - Universidade Federal de Mato Grosso do Sul (UFMS)false
dc.title.pt_BR.fl_str_mv	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens
title	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens
spellingShingle	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens Anderson Aparecido dos Santos Drones, Sensoriamento Remoto, Aprendizagem Profunda
title_short	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens
title_full	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens
title_fullStr	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens
title_full_unstemmed	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens
title_sort	Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens
author	Anderson Aparecido dos Santos
author_facet	Anderson Aparecido dos Santos
author_role	author
dc.contributor.advisor1.fl_str_mv	Wesley Nunes Goncalves
dc.contributor.author.fl_str_mv	Anderson Aparecido dos Santos
contributor_str_mv	Wesley Nunes Goncalves
dc.subject.por.fl_str_mv	Drones, Sensoriamento Remoto, Aprendizagem Profunda
topic	Drones, Sensoriamento Remoto, Aprendizagem Profunda
description	Detecting small objects in high-resolution images is a significant challenge in computer vision, especially in agricultural scenarios, where the size of insects tends to occupy only a few pixels relative to the entire image. Despite the advances brought by Convolutional Neural Networks (CNNs), the standard use of these architectures presents limitations, as the information associated with small objects tends to be lost due to downsampling and pooling operations. In this context, this thesis proposes strategies to improve the detection and segmentation of small objects. Initially, images are divided into smaller patches, and each patch is used individually to train and validate the models. For inference and prediction on the test set, the patches are overlapped, ensuring that in at least one of them, the object remains uncut. To address the issue of objects that end up being divided among different patches, techniques are presented to filter out-of-pattern predictions, those with low confidence scores, or with redundant and overlapping regions. To allow these techniques to also be applied to segmentation approaches, masks were generated from the original annotations, enabling the evaluation of both detection and segmentation models. Considering the limitations of the traditional Intersection over Union (IoU) metric for small objects, especially due to its sensitivity to minor spatial inaccuracies, this thesis also proposes an alternative metric based on the distance between the centers of the bounding boxes. The experimental results demonstrate that the proposed approaches contribute to the localization of small objects in high-resolution images, showing that both detection and segmentation techniques can be effective, as long as the data are properly processed before and after being input into the model.
publishDate	2025
dc.date.accessioned.fl_str_mv	2025-08-27T18:03:06Z
dc.date.available.fl_str_mv	2025-08-27T18:03:06Z
dc.date.issued.fl_str_mv	2025
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/doctoralThesis
format	doctoralThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://repositorio.ufms.br/handle/123456789/12481
url	https://repositorio.ufms.br/handle/123456789/12481
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Fundação Universidade Federal de Mato Grosso do Sul
dc.publisher.initials.fl_str_mv	UFMS
dc.publisher.country.fl_str_mv	Brasil
publisher.none.fl_str_mv	Fundação Universidade Federal de Mato Grosso do Sul
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFMS instname:Universidade Federal de Mato Grosso do Sul (UFMS) instacron:UFMS
instname_str	Universidade Federal de Mato Grosso do Sul (UFMS)
instacron_str	UFMS
institution	UFMS
reponame_str	Repositório Institucional da UFMS
collection	Repositório Institucional da UFMS
bitstream.url.fl_str_mv	https://repositorio.ufms.br/bitstream/123456789/12481/-1/tese_doutorado.pdf
bitstream.checksum.fl_str_mv	abdfaa3937ad06e90ed803e04db5152d
bitstream.checksumAlgorithm.fl_str_mv	MD5
repository.name.fl_str_mv	Repositório Institucional da UFMS - Universidade Federal de Mato Grosso do Sul (UFMS)
repository.mail.fl_str_mv	ri.prograd@ufms.br
_version_	1845881973898215424

Do Macro ao Micro: Abordagens em Reconhecimento e Segmentação de Objetos em Imagens

Registros relacionados