Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente

Shimura, Bruno Anthony

Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente

Detalhes bibliográficos
Ano de defesa:	2025
Autor(a) principal:	Shimura, Bruno Anthony
Orientador(a):	Bugatti, Pedro Henrique
Banca de defesa:	Não Informado pela instituição
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal de São Carlos Câmpus São Carlos
Programa de Pós-Graduação:	Programa de Pós-Graduação em Ciência da Computação - PPGCC
Departamento:	Não Informado pela instituição
País:	Não Informado pela instituição
Palavras-chave em Português:	Aprendizado autossupervisionado Aprendizado contrastivo Seleção de pares Aprendizado por distância Espaço latente Visão computacional
Palavras-chave em Inglês:	Self-supervised learning Contrastive learning Pair selection Distance-guided representation learning Latent space Computer vision
Área do conhecimento CNPq:	CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
Link de acesso:	https://hdl.handle.net/20.500.14289/23434
Resumo:	Self-Supervised Learning (SSL) has emerged as a powerful paradigm in computer vision, enabling models to learn meaningful feature representations directly from unlabeled data. Among SSL approaches, contrastive learning has gained particular prominence for its ability to induce discriminative embeddings by pulling together positive pairs and pushing apart negatives. However, random sampling of such pairs often disregards the underlying geometric structure of the latent space, leading to suboptimal representation quality and inconsistent class separation. To address this limitation, this work introduces Distance-Guided Contrastive Learning (DGCL), a self-supervised approach that systematically selects informative sample pairs based on their geometric configuration in the latent manifold. For each anchor sample, DGCL identifies the farthest intra-class examples (hard positives) and the nearest inter-class examples (hard negatives) through t-Distributed Stochastic Neighbor Embedding (t-SNE) projections. By iteratively refining these relationships across training cycles, the method progressively enhances intra-class compactness and inter-class separability. Experiments conducted on the CIFAR-10, FER-13, KDEF, and RAF-DB datasets demonstrate substantial improvements over conventional models trained without contrastive learning. The results reveal that DGCL yields geometrically consistent latent representations, characterized by reduced intra-class variance and well-structured semantic clusters.

Metadados do item

id	SCAR_b4bef281210d397168b7079ef768fd49
oai_identifier_str	oai:repositorio.ufscar.br:20.500.14289/23434
network_acronym_str	SCAR
network_name_str	Repositório Institucional da UFSCAR
repository_id_str
spelling	Shimura, Bruno AnthonyBugatti, Pedro Henriquehttp://lattes.cnpq.br/2177467029991118http://lattes.cnpq.br/2616108326649244Bueno, RenatoOliveira, Claiton dehttp://lattes.cnpq.br/7189857417959804http://lattes.cnpq.br/88512892651098912026-01-20T18:11:39Z2025-10-31SHIMURA, Bruno Anthony. Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente. 2025. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2025. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/23434.https://hdl.handle.net/20.500.14289/23434Self-Supervised Learning (SSL) has emerged as a powerful paradigm in computer vision, enabling models to learn meaningful feature representations directly from unlabeled data. Among SSL approaches, contrastive learning has gained particular prominence for its ability to induce discriminative embeddings by pulling together positive pairs and pushing apart negatives. However, random sampling of such pairs often disregards the underlying geometric structure of the latent space, leading to suboptimal representation quality and inconsistent class separation. To address this limitation, this work introduces Distance-Guided Contrastive Learning (DGCL), a self-supervised approach that systematically selects informative sample pairs based on their geometric configuration in the latent manifold. For each anchor sample, DGCL identifies the farthest intra-class examples (hard positives) and the nearest inter-class examples (hard negatives) through t-Distributed Stochastic Neighbor Embedding (t-SNE) projections. By iteratively refining these relationships across training cycles, the method progressively enhances intra-class compactness and inter-class separability. Experiments conducted on the CIFAR-10, FER-13, KDEF, and RAF-DB datasets demonstrate substantial improvements over conventional models trained without contrastive learning. The results reveal that DGCL yields geometrically consistent latent representations, characterized by reduced intra-class variance and well-structured semantic clusters.O aprendizado autossupervisionado (Self-Supervised Learning — SSL) tem se consolidado como um paradigma promissor no campo da visão computacional, por permitir o aprendizado de representações visuais robustas sem a necessidade de grandes volumes de dados rotulados. Entre suas vertentes, o aprendizado contrastivo destaca-se por induzir a formação de representações discriminativas por meio da aproximação de pares positivos e distanciamento de pares negativos. Entretanto, a seleção aleatória desses pares limita a exploração eficiente da estrutura geométrica subjacente aos dados, resultando em representações com separabilidade subótima entre classes. Com o objetivo de mitigar essa limitação, este trabalho propõe o Aprendizado Contrastivo Guiado por Distância (Distance-Guided Contrastive Learning — DGCL), uma abordagem autossupervisionada que introduz um mecanismo de seleção de pares baseado na estrutura geométrica do espaço latente. A proposta consiste em identificar, para cada amostra âncora, os exemplos mais informativos, as amostras positivas mais distantes dentro da mesma classe (hard positives) e as amostras negativas mais próximas de classes diferentes (hard negatives), utilizando projeções obtidas via t-Distributed Stochastic Neighbor Embedding (t-SNE). Essa política de seleção orientada pela distância é aplicada iterativamente, refinando progressivamente o espaço de representações. Os experimentos realizados nos conjuntos de imagens CIFAR-10, FER-13, KDEF e RAF-DB demonstram ganhos expressivos de desempenho em relação a modelos sem aprendizado contrastivo. Observa-se, em especial, que o DGCL promove uma compactação intra-classe mais pronunciada e uma separação inter-classe mais consistente, resultando em representações latentes mais coerentes e discriminativas. As análises qualitativas baseadas em projeções t-SNE e as matrizes de confusão confirmam a capacidade do método em estruturar o espaço latente de maneira semanticamente significativa.Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)porUniversidade Federal de São CarlosCâmpus São CarlosPrograma de Pós-Graduação em Ciência da Computação - PPGCCUFSCarAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessAprendizado autossupervisionadoAprendizado contrastivoSeleção de paresAprendizado por distânciaEspaço latenteVisão computacionalSelf-supervised learningContrastive learningPair selectionDistance-guided representation learningLatent spaceComputer visionCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO9. Indústria, Inovação e InfraestruturaAprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latenteContrastive self-supervised learning with distance-guided positive and negative pair selectioninfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisreponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALdissertacao_bruno_anthony_shimura_2025.pdfdissertacao_bruno_anthony_shimura_2025.pdfapplication/pdf15910540https://repositorio.ufscar.br/bitstreams/d5743ac2-f5ef-4ce2-bb7f-d5c57eaf3674/download0a30a6519741b38ba83e7698ecd82cb3MD51trueAnonymousREADCC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8906https://repositorio.ufscar.br/bitstreams/04e680da-7687-4a6e-80d2-02626a5a5c61/downloadfba754f0467e45ac3862bc2533fb2736MD52falseAnonymousREADTEXTdissertacao_bruno_anthony_shimura_2025.pdf.txtdissertacao_bruno_anthony_shimura_2025.pdf.txtExtracted texttext/plain103206https://repositorio.ufscar.br/bitstreams/2341e15e-038f-40ee-a6ac-28762bd7127f/download361fa54b0c95f5da3a07dd7f107fb073MD53falseAnonymousREADTHUMBNAILdissertacao_bruno_anthony_shimura_2025.pdf.jpgdissertacao_bruno_anthony_shimura_2025.pdf.jpgGenerated Thumbnailimage/jpeg4231https://repositorio.ufscar.br/bitstreams/0b2bfd81-eccf-4781-82cc-a31ed620e765/download81d1383b2cce9dcf3914ce7ff9ea37a0MD54falseAnonymousREAD20.500.14289/234342026-01-21T03:09:30.569984Zhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/Attribution-NonCommercial-NoDerivs 3.0 Brazilopen.accessoai:repositorio.ufscar.br:20.500.14289/23434https://repositorio.ufscar.brRepositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestrepositorio.sibi@ufscar.bropendoar:43222026-01-21T03:09:30Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false
dc.title.por.fl_str_mv	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente
dc.title.alternative.eng.fl_str_mv	Contrastive self-supervised learning with distance-guided positive and negative pair selection
title	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente
spellingShingle	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente Shimura, Bruno Anthony Aprendizado autossupervisionado Aprendizado contrastivo Seleção de pares Aprendizado por distância Espaço latente Visão computacional Self-supervised learning Contrastive learning Pair selection Distance-guided representation learning Latent space Computer vision CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO 9. Indústria, Inovação e Infraestrutura
title_short	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente
title_full	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente
title_fullStr	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente
title_full_unstemmed	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente
title_sort	Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente
author	Shimura, Bruno Anthony
author_facet	Shimura, Bruno Anthony
author_role	author
dc.contributor.authorlattes.none.fl_str_mv	http://lattes.cnpq.br/2616108326649244
dc.contributor.referee.none.fl_str_mv	Bueno, Renato Oliveira, Claiton de
dc.contributor.refereeLattes.none.fl_str_mv	http://lattes.cnpq.br/7189857417959804 http://lattes.cnpq.br/8851289265109891
dc.contributor.author.fl_str_mv	Shimura, Bruno Anthony
dc.contributor.advisor1.fl_str_mv	Bugatti, Pedro Henrique
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/2177467029991118
contributor_str_mv	Bugatti, Pedro Henrique
dc.subject.por.fl_str_mv	Aprendizado autossupervisionado Aprendizado contrastivo Seleção de pares Aprendizado por distância Espaço latente Visão computacional
topic	Aprendizado autossupervisionado Aprendizado contrastivo Seleção de pares Aprendizado por distância Espaço latente Visão computacional Self-supervised learning Contrastive learning Pair selection Distance-guided representation learning Latent space Computer vision CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO 9. Indústria, Inovação e Infraestrutura
dc.subject.eng.fl_str_mv	Self-supervised learning Contrastive learning Pair selection Distance-guided representation learning Latent space Computer vision
dc.subject.cnpq.fl_str_mv	CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
dc.subject.ods.none.fl_str_mv	9. Indústria, Inovação e Infraestrutura
description	Self-Supervised Learning (SSL) has emerged as a powerful paradigm in computer vision, enabling models to learn meaningful feature representations directly from unlabeled data. Among SSL approaches, contrastive learning has gained particular prominence for its ability to induce discriminative embeddings by pulling together positive pairs and pushing apart negatives. However, random sampling of such pairs often disregards the underlying geometric structure of the latent space, leading to suboptimal representation quality and inconsistent class separation. To address this limitation, this work introduces Distance-Guided Contrastive Learning (DGCL), a self-supervised approach that systematically selects informative sample pairs based on their geometric configuration in the latent manifold. For each anchor sample, DGCL identifies the farthest intra-class examples (hard positives) and the nearest inter-class examples (hard negatives) through t-Distributed Stochastic Neighbor Embedding (t-SNE) projections. By iteratively refining these relationships across training cycles, the method progressively enhances intra-class compactness and inter-class separability. Experiments conducted on the CIFAR-10, FER-13, KDEF, and RAF-DB datasets demonstrate substantial improvements over conventional models trained without contrastive learning. The results reveal that DGCL yields geometrically consistent latent representations, characterized by reduced intra-class variance and well-structured semantic clusters.
publishDate	2025
dc.date.issued.fl_str_mv	2025-10-31
dc.date.accessioned.fl_str_mv	2026-01-20T18:11:39Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	SHIMURA, Bruno Anthony. Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente. 2025. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2025. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/23434.
dc.identifier.uri.fl_str_mv	https://hdl.handle.net/20.500.14289/23434
identifier_str_mv	SHIMURA, Bruno Anthony. Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente. 2025. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2025. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/23434.
url	https://hdl.handle.net/20.500.14289/23434
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Universidade Federal de São Carlos Câmpus São Carlos
dc.publisher.program.fl_str_mv	Programa de Pós-Graduação em Ciência da Computação - PPGCC
dc.publisher.initials.fl_str_mv	UFSCar
publisher.none.fl_str_mv	Universidade Federal de São Carlos Câmpus São Carlos
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFSCAR instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR
instname_str	Universidade Federal de São Carlos (UFSCAR)
instacron_str	UFSCAR
institution	UFSCAR
reponame_str	Repositório Institucional da UFSCAR
collection	Repositório Institucional da UFSCAR
bitstream.url.fl_str_mv	https://repositorio.ufscar.br/bitstreams/d5743ac2-f5ef-4ce2-bb7f-d5c57eaf3674/download https://repositorio.ufscar.br/bitstreams/04e680da-7687-4a6e-80d2-02626a5a5c61/download https://repositorio.ufscar.br/bitstreams/2341e15e-038f-40ee-a6ac-28762bd7127f/download https://repositorio.ufscar.br/bitstreams/0b2bfd81-eccf-4781-82cc-a31ed620e765/download
bitstream.checksum.fl_str_mv	0a30a6519741b38ba83e7698ecd82cb3 fba754f0467e45ac3862bc2533fb2736 361fa54b0c95f5da3a07dd7f107fb073 81d1383b2cce9dcf3914ce7ff9ea37a0
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv	repositorio.sibi@ufscar.br
_version_	1859391320294948864

Aprendizado autossupervisionado contrastivo orientado pela estrutura geométrica do espaço latente

Registros relacionados