Deep learning para classificação hierárquica de elementos transponíveis
| Ano de defesa: | 2018 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Dissertação |
| Tipo de acesso: | Acesso aberto |
| Idioma: | por |
| Instituição de defesa: |
Universidade Federal de São Carlos
Câmpus São Carlos |
| Programa de Pós-Graduação: |
Programa de Pós-Graduação em Ciência da Computação - PPGCC
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Palavras-chave em Inglês: | |
| Área do conhecimento CNPq: | |
| Link de acesso: | https://repositorio.ufscar.br/handle/20.500.14289/10532 |
Resumo: | Transposable Elements (TEs) are DNA sequences that can change its location within a cell's genome. They contribute directly to the genetic variety of species. Besides, their transposition mechanisms can affect the functionality of genes. The correct identification and classification of TEs play a central role in comprehension of genomes. Generally, identification and classification of TEs are performed using tools that employs homology, by comparing a sequence to many sequences from a labeled TE database. Since the literature proposes hierarchical taxonomies to classify TEs according to classes and subclasses, this project aims to develop new classification methods employing Machine Learning (AM) and Artificial Neural Networks (RNA) trained using Deep Learning (DP) concepts. Deep Neural Networks have extend the state-of-art of many field of study, including bioinformatics. As the first step, DNA sequences labelled with previously identified TEs will be collected and mapped according to hierarchies provided by the literature. Next, Deep Learning's neural networks Restricted Booltzman Machine, Auto-encoders, MultiLayer Perceptrons and their stacked version were tested. With these datasets, different classification methods are proposed and compared with literature's methods. As contributions, two new strategies were proposed, nLLCPN (non-Leaf Local Classifier per Parent Node) and LCPNB (Classifier per Parent Node and Branch). Both of then adapt LCPN (Local Classifier per Parent Node) in order to allow classifications in inner nodes. Additionally, the deep neural networks presented superior or competitive results in most of the cases. |
| id |
SCAR_478cb88acebf8967783604126c6ecfa7 |
|---|---|
| oai_identifier_str |
oai:repositorio.ufscar.br:20.500.14289/10532 |
| network_acronym_str |
SCAR |
| network_name_str |
Repositório Institucional da UFSCAR |
| repository_id_str |
|
| spelling |
Nakano, Felipe KenjiCerri, Ricardohttp://lattes.cnpq.br/6266519868438512http://lattes.cnpq.br/6192556929748278adc543d5-0453-4f4b-8360-1e70510ca1ce2018-10-01T18:16:53Z2018-10-01T18:16:53Z2018-09-05NAKANO, Felipe Kenji. Deep learning para classificação hierárquica de elementos transponíveis. 2018. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2018. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/10532.https://repositorio.ufscar.br/handle/20.500.14289/10532Transposable Elements (TEs) are DNA sequences that can change its location within a cell's genome. They contribute directly to the genetic variety of species. Besides, their transposition mechanisms can affect the functionality of genes. The correct identification and classification of TEs play a central role in comprehension of genomes. Generally, identification and classification of TEs are performed using tools that employs homology, by comparing a sequence to many sequences from a labeled TE database. Since the literature proposes hierarchical taxonomies to classify TEs according to classes and subclasses, this project aims to develop new classification methods employing Machine Learning (AM) and Artificial Neural Networks (RNA) trained using Deep Learning (DP) concepts. Deep Neural Networks have extend the state-of-art of many field of study, including bioinformatics. As the first step, DNA sequences labelled with previously identified TEs will be collected and mapped according to hierarchies provided by the literature. Next, Deep Learning's neural networks Restricted Booltzman Machine, Auto-encoders, MultiLayer Perceptrons and their stacked version were tested. With these datasets, different classification methods are proposed and compared with literature's methods. As contributions, two new strategies were proposed, nLLCPN (non-Leaf Local Classifier per Parent Node) and LCPNB (Classifier per Parent Node and Branch). Both of then adapt LCPN (Local Classifier per Parent Node) in order to allow classifications in inner nodes. Additionally, the deep neural networks presented superior or competitive results in most of the cases.Elementos Transponíveis (TEs) são sequências de DNA que podem se mover de um local para outro dentro do genoma de uma célula. Eles contribuem para a diversidade genética das espécies, e seus mecanismos de transposição podem afetar a funcionalidade dos genes. A correta identificação e classificação de TEs é útil para a compreensão de seus efeitos nos genomas. Como existem propostas na literatura para organizar os TEs em uma taxonomia hierárquica, com superclasses e subclasses, este trabalho investigou métodos de Classificação Hierárquica (CH) de TEs utilizando Aprendizado de Máquina (AM), e Redes Neurais Artificiais (RNAs) do paradigma Deep Learning (DP). RNAs profundas têm-se mostrado promissoras em diversos campos de estudo, incluindo bioinformática. Inicialmente, sequências com TEs previamente identificados foram coletadas e estruturadas de acordo com taxonomias hierárquicas. Em seguida, as redes neurais Restricted Booltzman Machine, Denoising Auto-encoder, MultiLayer Perceptron e suas versões empilhadas foram testadas. De posse dos conjuntos de dados, diferentes métodos de classificação foram propostos, e comparados com métodos existentes na literatura. Como resultados dessa pesquisa, foram propostas duas novas estratégias, chamadas nLLCPN (Classificador Local por Nó Pai não Folha) e LCPNB (Classificador Local por Nó Pai e Ramo). Ambas adaptam a estratégia LCPN (Classificador Local por Nó Pai), de maneira que classificações em nós internos sejam permitidas. Adicionalmente, as redes profundas apresentam desempenho superior ou competitivo à arquiteturas rasas na maioria dos casos.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)FAPESP: 2016/12489-2FAPESP: 2017/19264-9porUniversidade Federal de São CarlosCâmpus São CarlosPrograma de Pós-Graduação em Ciência da Computação - PPGCCUFSCarAprendizado de máquinaElementos transponíveisClassificação hierárquicaTransposable elementsHierarchical classificationDeep learningCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAODeep learning para classificação hierárquica de elementos transponíveisDeep learning for hierarchical classification of transposable elementsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisOnline600600c997f5ee-db84-40ed-8971-521dd105f2d1info:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALDissertacaoFInal.pdfDissertacaoFInal.pdfDissertacao com a folha de aprovação após a folha de rostoapplication/pdf1235200https://repositorio.ufscar.br/bitstreams/2548d707-5409-422f-bdb3-fa2d40403b8a/download9ee28757ae8ca2582459c37fa3c5816aMD512trueAnonymousREADLICENSElicense.txtlicense.txttext/plain; charset=utf-81957https://repositorio.ufscar.br/bitstreams/80373eb8-e971-474d-a65f-d6ee6eb1986f/downloadae0398b6f8b235e40ad82cba6c50031dMD513falseAnonymousREADTEXTDissertacaoFInal.pdf.txtDissertacaoFInal.pdf.txtExtracted texttext/plain245185https://repositorio.ufscar.br/bitstreams/cc4c175c-5eff-409a-8354-f3b28c6bda09/download59674b1b677ef364b00bcde38072f1ccMD516falseAnonymousREADTHUMBNAILDissertacaoFInal.pdf.jpgDissertacaoFInal.pdf.jpgIM Thumbnailimage/jpeg7561https://repositorio.ufscar.br/bitstreams/35d2127b-1ecc-46ce-890b-96fdd33ee56f/download228441872f8f3b8a6f1963dbe79a376cMD517falseAnonymousREAD20.500.14289/105322025-02-05 17:59:15.103Acesso abertoopen.accessoai:repositorio.ufscar.br:20.500.14289/10532https://repositorio.ufscar.brRepositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestrepositorio.sibi@ufscar.bropendoar:43222025-02-05T20:59:15Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)falseTElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTy1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YcOnw6NvIGRlc3RhIGxpY2Vuw6dhLCB2b2PDqiAobyBhdXRvciAoZXMpIG91IG8gdGl0dWxhciBkb3MgZGlyZWl0b3MgZGUgYXV0b3IpIGNvbmNlZGUgw6AgVW5pdmVyc2lkYWRlCkZlZGVyYWwgZGUgU8OjbyBDYXJsb3MgbyBkaXJlaXRvIG7Do28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsICB0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSwgZS9vdQpkaXN0cmlidWlyIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyAoaW5jbHVpbmRvIG8gcmVzdW1vKSBwb3IgdG9kbyBvIG11bmRvIG5vIGZvcm1hdG8gaW1wcmVzc28gZSBlbGV0csO0bmljbyBlCmVtIHF1YWxxdWVyIG1laW8sIGluY2x1aW5kbyBvcyBmb3JtYXRvcyDDoXVkaW8gb3UgdsOtZGVvLgoKVm9jw6ogY29uY29yZGEgcXVlIGEgVUZTQ2FyIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28KcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBhIFVGU0NhciBwb2RlIG1hbnRlciBtYWlzIGRlIHVtYSBjw7NwaWEgYSBzdWEgdGVzZSBvdQpkaXNzZXJ0YcOnw6NvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIGRlY2xhcmEgcXVlIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyDDqSBvcmlnaW5hbCBlIHF1ZSB2b2PDqiB0ZW0gbyBwb2RlciBkZSBjb25jZWRlciBvcyBkaXJlaXRvcyBjb250aWRvcwpuZXN0YSBsaWNlbsOnYS4gVm9jw6ogdGFtYsOpbSBkZWNsYXJhIHF1ZSBvIGRlcMOzc2l0byBkYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIG7Do28sIHF1ZSBzZWphIGRlIHNldQpjb25oZWNpbWVudG8sIGluZnJpbmdlIGRpcmVpdG9zIGF1dG9yYWlzIGRlIG5pbmd1w6ltLgoKQ2FzbyBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gY29udGVuaGEgbWF0ZXJpYWwgcXVlIHZvY8OqIG7Do28gcG9zc3VpIGEgdGl0dWxhcmlkYWRlIGRvcyBkaXJlaXRvcyBhdXRvcmFpcywgdm9jw6oKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIGRldGVudG9yIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBwYXJhIGNvbmNlZGVyIMOgIFVGU0NhcgpvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUKaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBURVNFIE9VIERJU1NFUlRBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UKQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PIFFVRSBOw4NPIFNFSkEgQSBVRlNDYXIsClZPQ8OKIERFQ0xBUkEgUVVFIFJFU1BFSVRPVSBUT0RPUyBFIFFVQUlTUVVFUiBESVJFSVRPUyBERSBSRVZJU8ODTyBDT01PClRBTULDiU0gQVMgREVNQUlTIE9CUklHQcOHw5VFUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKQSBVRlNDYXIgc2UgY29tcHJvbWV0ZSBhIGlkZW50aWZpY2FyIGNsYXJhbWVudGUgbyBzZXUgbm9tZSAocykgb3UgbyhzKSBub21lKHMpIGRvKHMpCmRldGVudG9yKGVzKSBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIGFsw6ltIGRhcXVlbGFzCmNvbmNlZGlkYXMgcG9yIGVzdGEgbGljZW7Dp2EuCg== |
| dc.title.por.fl_str_mv |
Deep learning para classificação hierárquica de elementos transponíveis |
| dc.title.alternative.eng.fl_str_mv |
Deep learning for hierarchical classification of transposable elements |
| title |
Deep learning para classificação hierárquica de elementos transponíveis |
| spellingShingle |
Deep learning para classificação hierárquica de elementos transponíveis Nakano, Felipe Kenji Aprendizado de máquina Elementos transponíveis Classificação hierárquica Transposable elements Hierarchical classification Deep learning CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO |
| title_short |
Deep learning para classificação hierárquica de elementos transponíveis |
| title_full |
Deep learning para classificação hierárquica de elementos transponíveis |
| title_fullStr |
Deep learning para classificação hierárquica de elementos transponíveis |
| title_full_unstemmed |
Deep learning para classificação hierárquica de elementos transponíveis |
| title_sort |
Deep learning para classificação hierárquica de elementos transponíveis |
| author |
Nakano, Felipe Kenji |
| author_facet |
Nakano, Felipe Kenji |
| author_role |
author |
| dc.contributor.authorlattes.por.fl_str_mv |
http://lattes.cnpq.br/6192556929748278 |
| dc.contributor.author.fl_str_mv |
Nakano, Felipe Kenji |
| dc.contributor.advisor1.fl_str_mv |
Cerri, Ricardo |
| dc.contributor.advisor1Lattes.fl_str_mv |
http://lattes.cnpq.br/6266519868438512 |
| dc.contributor.authorID.fl_str_mv |
adc543d5-0453-4f4b-8360-1e70510ca1ce |
| contributor_str_mv |
Cerri, Ricardo |
| dc.subject.por.fl_str_mv |
Aprendizado de máquina Elementos transponíveis Classificação hierárquica |
| topic |
Aprendizado de máquina Elementos transponíveis Classificação hierárquica Transposable elements Hierarchical classification Deep learning CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO |
| dc.subject.eng.fl_str_mv |
Transposable elements Hierarchical classification Deep learning |
| dc.subject.cnpq.fl_str_mv |
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO |
| description |
Transposable Elements (TEs) are DNA sequences that can change its location within a cell's genome. They contribute directly to the genetic variety of species. Besides, their transposition mechanisms can affect the functionality of genes. The correct identification and classification of TEs play a central role in comprehension of genomes. Generally, identification and classification of TEs are performed using tools that employs homology, by comparing a sequence to many sequences from a labeled TE database. Since the literature proposes hierarchical taxonomies to classify TEs according to classes and subclasses, this project aims to develop new classification methods employing Machine Learning (AM) and Artificial Neural Networks (RNA) trained using Deep Learning (DP) concepts. Deep Neural Networks have extend the state-of-art of many field of study, including bioinformatics. As the first step, DNA sequences labelled with previously identified TEs will be collected and mapped according to hierarchies provided by the literature. Next, Deep Learning's neural networks Restricted Booltzman Machine, Auto-encoders, MultiLayer Perceptrons and their stacked version were tested. With these datasets, different classification methods are proposed and compared with literature's methods. As contributions, two new strategies were proposed, nLLCPN (non-Leaf Local Classifier per Parent Node) and LCPNB (Classifier per Parent Node and Branch). Both of then adapt LCPN (Local Classifier per Parent Node) in order to allow classifications in inner nodes. Additionally, the deep neural networks presented superior or competitive results in most of the cases. |
| publishDate |
2018 |
| dc.date.accessioned.fl_str_mv |
2018-10-01T18:16:53Z |
| dc.date.available.fl_str_mv |
2018-10-01T18:16:53Z |
| dc.date.issued.fl_str_mv |
2018-09-05 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.citation.fl_str_mv |
NAKANO, Felipe Kenji. Deep learning para classificação hierárquica de elementos transponíveis. 2018. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2018. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/10532. |
| dc.identifier.uri.fl_str_mv |
https://repositorio.ufscar.br/handle/20.500.14289/10532 |
| identifier_str_mv |
NAKANO, Felipe Kenji. Deep learning para classificação hierárquica de elementos transponíveis. 2018. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2018. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/10532. |
| url |
https://repositorio.ufscar.br/handle/20.500.14289/10532 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.relation.confidence.fl_str_mv |
600 600 |
| dc.relation.authority.fl_str_mv |
c997f5ee-db84-40ed-8971-521dd105f2d1 |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos |
| dc.publisher.program.fl_str_mv |
Programa de Pós-Graduação em Ciência da Computação - PPGCC |
| dc.publisher.initials.fl_str_mv |
UFSCar |
| publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFSCAR instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR |
| instname_str |
Universidade Federal de São Carlos (UFSCAR) |
| instacron_str |
UFSCAR |
| institution |
UFSCAR |
| reponame_str |
Repositório Institucional da UFSCAR |
| collection |
Repositório Institucional da UFSCAR |
| bitstream.url.fl_str_mv |
https://repositorio.ufscar.br/bitstreams/2548d707-5409-422f-bdb3-fa2d40403b8a/download https://repositorio.ufscar.br/bitstreams/80373eb8-e971-474d-a65f-d6ee6eb1986f/download https://repositorio.ufscar.br/bitstreams/cc4c175c-5eff-409a-8354-f3b28c6bda09/download https://repositorio.ufscar.br/bitstreams/35d2127b-1ecc-46ce-890b-96fdd33ee56f/download |
| bitstream.checksum.fl_str_mv |
9ee28757ae8ca2582459c37fa3c5816a ae0398b6f8b235e40ad82cba6c50031d 59674b1b677ef364b00bcde38072f1cc 228441872f8f3b8a6f1963dbe79a376c |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR) |
| repository.mail.fl_str_mv |
repositorio.sibi@ufscar.br |
| _version_ |
1851688943082274816 |