Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano

Detalhes bibliográficos
Ano de defesa: 2025
Autor(a) principal: Thais Silva Tavares
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Minas Gerais
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://hdl.handle.net/1843/83006
Resumo: The evolution of the genome and the emergence of genetic diseases are driven by mutations, which arise through various mechanisms and shape genetic variability and species adaptation. Despite decades of research on variations in mutation rates and types across genomic regions, few studies have systematically integrated comparative analyses of their spatial distribution in the human genome. In this study, we conducted a large-scale assessment of the distribution and functional characterization of single nucleotide variants (SNVs) across five distinct categories: (i) neutral or nearly neutral polymorphisms, (ii) rare variants, (iii) cancer-associated somatic mutations, (iv) clinically relevant pathogenic germline variants, and (v) benign variants. Using data from public repositories such as COSMIC, ClinVar, and HGDP, we proposed three models to describe the mutational landscape of the human genome. Our findings reveal a significant overlap between different SNV classes, suggesting that certain genomic regions are inherently more prone to accumulating mutations, regardless of their germline or somatic origin. Notably, single nucleotide polymorphisms (SNPs) and recurrent cancer mutations frequently co-occur at the same genomic loci and share identical alleles more often than expected by chance. Five mutational signatures (SBS1, SBS5, SBS6, SBS54, and SBS87) were associated with these shared sites, while CpG islands and microsatellites accounted for only a minor fraction of the observed mutations. Furthermore, functional enrichment analyses identified significant associations between the overlapping regions and key genomic sites, such as 13q12.12 and 19p13.3, as well as pathways involved in cellular signaling and extracellular matrix interactions, including PI3K/AKT/mTOR and ECM-receptor interaction, both of which are crucial for tumor progression. Based on these findings, we propose the existence of critical DNA loci that function as natural mutational hotspots, recurrently affected in both germline and somatic lineages. The identification of these hotspots raises important questions regarding the underlying mechanisms driving their occurrence and their potential role in cancer predisposition.
id UFMG_5dc9b15d3366327bc4e396972c024463
oai_identifier_str oai:repositorio.ufmg.br:1843/83006
network_acronym_str UFMG
network_name_str Repositório Institucional da UFMG
repository_id_str
spelling 2025-06-17T18:35:17Z2025-09-08T23:54:44Z2025-06-17T18:35:17Z2025-05-22https://hdl.handle.net/1843/83006The evolution of the genome and the emergence of genetic diseases are driven by mutations, which arise through various mechanisms and shape genetic variability and species adaptation. Despite decades of research on variations in mutation rates and types across genomic regions, few studies have systematically integrated comparative analyses of their spatial distribution in the human genome. In this study, we conducted a large-scale assessment of the distribution and functional characterization of single nucleotide variants (SNVs) across five distinct categories: (i) neutral or nearly neutral polymorphisms, (ii) rare variants, (iii) cancer-associated somatic mutations, (iv) clinically relevant pathogenic germline variants, and (v) benign variants. Using data from public repositories such as COSMIC, ClinVar, and HGDP, we proposed three models to describe the mutational landscape of the human genome. Our findings reveal a significant overlap between different SNV classes, suggesting that certain genomic regions are inherently more prone to accumulating mutations, regardless of their germline or somatic origin. Notably, single nucleotide polymorphisms (SNPs) and recurrent cancer mutations frequently co-occur at the same genomic loci and share identical alleles more often than expected by chance. Five mutational signatures (SBS1, SBS5, SBS6, SBS54, and SBS87) were associated with these shared sites, while CpG islands and microsatellites accounted for only a minor fraction of the observed mutations. Furthermore, functional enrichment analyses identified significant associations between the overlapping regions and key genomic sites, such as 13q12.12 and 19p13.3, as well as pathways involved in cellular signaling and extracellular matrix interactions, including PI3K/AKT/mTOR and ECM-receptor interaction, both of which are crucial for tumor progression. Based on these findings, we propose the existence of critical DNA loci that function as natural mutational hotspots, recurrently affected in both germline and somatic lineages. The identification of these hotspots raises important questions regarding the underlying mechanisms driving their occurrence and their potential role in cancer predisposition.CAPES - Coordenação de Aperfeiçoamento de Pessoal de Nível SuperiorporUniversidade Federal de Minas Geraishttp://creativecommons.org/licenses/by-nc-nd/3.0/pt/info:eu-repo/semantics/openAccessMutações humanasVariantes de nucleotídeo único (SNVs)Hotspots mutacionaisLoci compartilhados germinativo-somáticoBioinformáticaMutaçãoPolimorfismo de Nucleotídeo ÚnicoHotspot de DoençaCenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humanoMutational landscape of the human genome: co-occurrence of single nucleotide variants (SNVs) of different evolutionary origins and functional consequencesinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisThais Silva Tavaresreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGhttp://lattes.cnpq.br/1226456190294355Francisco Pereira Lobohttp://lattes.cnpq.br/9614758933055047Renan Pedra de SouzaMateus Henrique GouveiaTetsu SakamotoLuiz Eduardo Vieira Del BemGlória Regina FrancoA evolução do genoma e o surgimento de doenças genéticas são impulsionados por mutações, que ocorrem por diferentes mecanismos e impactam a variabilidade genética e a adaptação das espécies. Apesar de décadas de pesquisa sobre variações nas taxas e tipos de mutações em regiões genômicas, poucos estudos integraram análises comparativas de sua distribuição espacial no genoma humano. Neste estudo, realizamos uma avaliação em larga escala da distribuição e caracterização funcional de Variantes de Nucleotídeo Único (SNVs) pertencentes a cinco categorias: (i) polimorfismos neutros ou quase neutros; (ii) variantes raras; (iii) mutações somáticas associadas ao câncer; (iv) variantes germinativas patogênicas de relevância clínica; e (v) variantes benignas. Utilizamos dados dos bancos públicos COSMIC, ClinVar e HGDP para propor três modelos de distribuição mutacional no genoma humano. Nossos resultados revelaram uma sobreposição significativa entre diferentes classes de SNVs, sugerindo que certas regiões genômicas são mais propensas a acumular mutações, independentemente de sua origem germinativa ou somática. Notavelmente, Polimorfismos de Nucleotídeo Único (SNPs) e mutações recorrentes no câncer ocorrem frequentemente nos mesmos loci genômicos e compartilham os mesmos alelos com uma frequência superior ao esperado por acaso. Identificamos cinco assinaturas mutacionais associadas a esses sítios compartilhados (SBS1, SBS5, SBS6, SBS54 e SBS87), enquanto ilhas CpG e microssatélites explicam apenas uma pequena fração dessas mutações. Além disso, análises de enriquecimento funcional apontaram associações significativas entre as regiões sobrepostas e áreas genômicas-chave, como 13q12.12 e 19p13.3, bem como vias relacionadas à sinalização celular e interação da matriz extracelular, incluindo PI3K/AKT/mTOR e ECM-receptor interaction, ambas fundamentais para a progressão tumoral. Com base nesses achados, sugerimos a existência de loci críticos no DNA que atuam como hotspots mutacionais naturais, suscetíveis a mutações recorrentes em ambas as linhagens celulares. A identificação desses hotspots levanta questões sobre os mecanismos subjacentes à sua ocorrência e sua possível contribuição para a predisposição ao câncer.https://orcid.org/0009-0008-3042-3334BrasilICB - DEPARTAMENTO DE BIOQUÍMICA E IMUNOLOGIAPrograma de Pós-Graduação em BioinformaticaUFMGORIGINALThaisTavares_teseFinal.pdfapplication/pdf8341134https://repositorio.ufmg.br//bitstreams/8f768df2-90aa-426e-ba70-d1978e4215c8/download7bc41c9a82f78904900b60688830704bMD51trueAnonymousREADCC-LICENSElicense_rdfapplication/octet-stream811https://repositorio.ufmg.br//bitstreams/dd7a70b6-0200-4983-8d0b-c4eefafedf4c/downloadcfd6801dba008cb6adbd9838b81582abMD52falseAnonymousREADLICENSElicense.txttext/plain2118https://repositorio.ufmg.br//bitstreams/00c95339-63bd-4c7d-8406-5f6ef7c82c85/downloadcda590c95a0b51b4d15f60c9642ca272MD53falseAnonymousREAD1843/830062025-09-08 20:54:44.888http://creativecommons.org/licenses/by-nc-nd/3.0/pt/Acesso Abertoopen.accessoai:repositorio.ufmg.br:1843/83006https://repositorio.ufmg.br/Repositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2025-09-08T23:54:44Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)falseTElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTy1FWENMVVNJVkEgRE8gUkVQT1NJVMOTUklPIElOU1RJVFVDSU9OQUwgREEgVUZNRwoKQ29tIGEgYXByZXNlbnRhw6fDo28gZGVzdGEgbGljZW7Dp2EsIHZvY8OqIChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSBhbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIChSSS1VRk1HKSBvIGRpcmVpdG8gbsOjbyBleGNsdXNpdm8gZSBpcnJldm9nw6F2ZWwgZGUgcmVwcm9kdXppciBlL291IGRpc3RyaWJ1aXIgYSBzdWEgcHVibGljYcOnw6NvIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyw7RuaWNvIGUgZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zIMOhdWRpbyBvdSB2w61kZW8uCgpWb2PDqiBkZWNsYXJhIHF1ZSBjb25oZWNlIGEgcG9sw610aWNhIGRlIGNvcHlyaWdodCBkYSBlZGl0b3JhIGRvIHNldSBkb2N1bWVudG8gZSBxdWUgY29uaGVjZSBlIGFjZWl0YSBhcyBEaXJldHJpemVzIGRvIFJJLVVGTUcuCgpWb2PDqiBjb25jb3JkYSBxdWUgbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSBwdWJsaWNhw6fDo28gcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGTUcgcG9kZSBtYW50ZXIgbWFpcyBkZSB1bWEgY8OzcGlhIGRlIHN1YSBwdWJsaWNhw6fDo28gcGFyYSBmaW5zIGRlIHNlZ3VyYW7Dp2EsIGJhY2stdXAgZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogZGVjbGFyYSBxdWUgYSBzdWEgcHVibGljYcOnw6NvIMOpIG9yaWdpbmFsIGUgcXVlIHZvY8OqIHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIG8gZGVww7NzaXRvIGRlIHN1YSBwdWJsaWNhw6fDo28gbsOjbywgcXVlIHNlamEgZGUgc2V1IGNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHB1YmxpY2HDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiBkZWNsYXJhIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgYW8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHB1YmxpY2HDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBQVUJMSUNBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UgQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyBUQU1Cw4lNIEFTIERFTUFJUyBPQlJJR0HDh8OVRVMgRVhJR0lEQVMgUE9SIENPTlRSQVRPIE9VIEFDT1JETy4KCk8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lKHMpIG91IG8ocykgbm9tZXMocykgZG8ocykgZGV0ZW50b3IoZXMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkYSBwdWJsaWNhw6fDo28sIGUgbsOjbyBmYXLDoSBxdWFscXVlciBhbHRlcmHDp8OjbywgYWzDqW0gZGFxdWVsYXMgY29uY2VkaWRhcyBwb3IgZXN0YSBsaWNlbsOnYS4K
dc.title.none.fl_str_mv Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
dc.title.alternative.none.fl_str_mv Mutational landscape of the human genome: co-occurrence of single nucleotide variants (SNVs) of different evolutionary origins and functional consequences
title Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
spellingShingle Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
Thais Silva Tavares
Bioinformática
Mutação
Polimorfismo de Nucleotídeo Único
Hotspot de Doença
Mutações humanas
Variantes de nucleotídeo único (SNVs)
Hotspots mutacionais
Loci compartilhados germinativo-somático
title_short Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
title_full Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
title_fullStr Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
title_full_unstemmed Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
title_sort Cenário de coocorrência de variantes de nucleotídeo único (SNVs) de diferentes origens evolutivas e consequências funcionais no genoma humano
author Thais Silva Tavares
author_facet Thais Silva Tavares
author_role author
dc.contributor.author.fl_str_mv Thais Silva Tavares
dc.subject.por.fl_str_mv Bioinformática
Mutação
Polimorfismo de Nucleotídeo Único
Hotspot de Doença
topic Bioinformática
Mutação
Polimorfismo de Nucleotídeo Único
Hotspot de Doença
Mutações humanas
Variantes de nucleotídeo único (SNVs)
Hotspots mutacionais
Loci compartilhados germinativo-somático
dc.subject.other.none.fl_str_mv Mutações humanas
Variantes de nucleotídeo único (SNVs)
Hotspots mutacionais
Loci compartilhados germinativo-somático
description The evolution of the genome and the emergence of genetic diseases are driven by mutations, which arise through various mechanisms and shape genetic variability and species adaptation. Despite decades of research on variations in mutation rates and types across genomic regions, few studies have systematically integrated comparative analyses of their spatial distribution in the human genome. In this study, we conducted a large-scale assessment of the distribution and functional characterization of single nucleotide variants (SNVs) across five distinct categories: (i) neutral or nearly neutral polymorphisms, (ii) rare variants, (iii) cancer-associated somatic mutations, (iv) clinically relevant pathogenic germline variants, and (v) benign variants. Using data from public repositories such as COSMIC, ClinVar, and HGDP, we proposed three models to describe the mutational landscape of the human genome. Our findings reveal a significant overlap between different SNV classes, suggesting that certain genomic regions are inherently more prone to accumulating mutations, regardless of their germline or somatic origin. Notably, single nucleotide polymorphisms (SNPs) and recurrent cancer mutations frequently co-occur at the same genomic loci and share identical alleles more often than expected by chance. Five mutational signatures (SBS1, SBS5, SBS6, SBS54, and SBS87) were associated with these shared sites, while CpG islands and microsatellites accounted for only a minor fraction of the observed mutations. Furthermore, functional enrichment analyses identified significant associations between the overlapping regions and key genomic sites, such as 13q12.12 and 19p13.3, as well as pathways involved in cellular signaling and extracellular matrix interactions, including PI3K/AKT/mTOR and ECM-receptor interaction, both of which are crucial for tumor progression. Based on these findings, we propose the existence of critical DNA loci that function as natural mutational hotspots, recurrently affected in both germline and somatic lineages. The identification of these hotspots raises important questions regarding the underlying mechanisms driving their occurrence and their potential role in cancer predisposition.
publishDate 2025
dc.date.accessioned.fl_str_mv 2025-06-17T18:35:17Z
2025-09-08T23:54:44Z
dc.date.available.fl_str_mv 2025-06-17T18:35:17Z
dc.date.issued.fl_str_mv 2025-05-22
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1843/83006
url https://hdl.handle.net/1843/83006
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv http://creativecommons.org/licenses/by-nc-nd/3.0/pt/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc-nd/3.0/pt/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFMG
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Repositório Institucional da UFMG
collection Repositório Institucional da UFMG
bitstream.url.fl_str_mv https://repositorio.ufmg.br//bitstreams/8f768df2-90aa-426e-ba70-d1978e4215c8/download
https://repositorio.ufmg.br//bitstreams/dd7a70b6-0200-4983-8d0b-c4eefafedf4c/download
https://repositorio.ufmg.br//bitstreams/00c95339-63bd-4c7d-8406-5f6ef7c82c85/download
bitstream.checksum.fl_str_mv 7bc41c9a82f78904900b60688830704b
cfd6801dba008cb6adbd9838b81582ab
cda590c95a0b51b4d15f60c9642ca272
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv repositorio@ufmg.br
_version_ 1862106018018754560