GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos

Detalhes bibliográficos
Ano de defesa: 2017
Autor(a) principal: Oliveira, Matheus Brito de lattes
Orientador(a): Queir?z, Artur Trancoso Lopo
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Estadual de Feira de Santana
Programa de Pós-Graduação: Mestrado em Computa??o Aplicada
Departamento: DEPARTAMENTO DE CI?NCIAS EXATAS
País: Brasil
Palavras-chave em Português:
NGS
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: http://localhost:8080/tede/handle/tede/513
Resumo: The assembly of bacterial genomes consists of a process of reordering fragments so that the original genome can be represented. However, to maximize the results of genome assembly, some steps are required, for instance, read quality analysis and preprocessing, repetition identification and quality check. The process of assembly of genomes is a complex step that involves the type of sequencing that was used, there are several types of sequencers which imply different characteristics for each one for example: fragments size, throughput, among others. Analyzing these characteristics requires the use of several computational tools, to assist in all the processes mentioned above, and since the range of software available is quite broad and distinct, it is necessary for the user to learn to work with this computational diversity, dominating often knowledge that is not of the biological area, implying in less time for a deepening in biological questions. Based on this context, we developed a pipeline to perform an automated fragment analysis, read preprocessing, genome assembly and orientation of contigs, having as the assembly the main objective of the pipeline and that it will be managed by a Web application called GATOOL (Genome Assembly Tool). Aiming to evaluate the performance of the application, tests were carried out with two samples of prokaryotic organisms, which are: Bacillus amyloliquefaciens and Serratia marcescens. Also perform a test with seven SRA samples. Both organisms are sequenced on the Ion PGMTM platform. The tools used to perform the assembly were SPAdes and Velvet, both assemblers use de Bruijn graph algorithm as a paradigm for the assembly of the genome, after this stage the resulting set of contigs was ordered through the CONTIGuator, which is a reference ordering. We observed that the interface GATOOL allowed a quick and easy execution of several steps and processes in the field of genome assembly, including the assembly of two prokaryotic species in an automated way, thus facilitating the use and accomplishment of such processes by any user.
id UEFS_4fb9892e0342322fa4fa9ef63c1a663d
oai_identifier_str oai:tede2.uefs.br:8080:tede/513
network_acronym_str UEFS
network_name_str Biblioteca Digital de Teses e Dissertações da UEFS
repository_id_str
spelling Queir?z, Artur Trancoso Lopo01493806580http://lattes.cnpq.br/0008785408235675Oliveira, Matheus Brito de2017-10-09T22:34:41Z2017-06-12OLIVEIRA, Matheus Brito de. GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos. 2017. 95 f. Disserta??o (Mestrado em Computa??o Aplicada)- Universidade Estadual de Feira de Santana, Feira de Santana, 2017.http://localhost:8080/tede/handle/tede/513The assembly of bacterial genomes consists of a process of reordering fragments so that the original genome can be represented. However, to maximize the results of genome assembly, some steps are required, for instance, read quality analysis and preprocessing, repetition identification and quality check. The process of assembly of genomes is a complex step that involves the type of sequencing that was used, there are several types of sequencers which imply different characteristics for each one for example: fragments size, throughput, among others. Analyzing these characteristics requires the use of several computational tools, to assist in all the processes mentioned above, and since the range of software available is quite broad and distinct, it is necessary for the user to learn to work with this computational diversity, dominating often knowledge that is not of the biological area, implying in less time for a deepening in biological questions. Based on this context, we developed a pipeline to perform an automated fragment analysis, read preprocessing, genome assembly and orientation of contigs, having as the assembly the main objective of the pipeline and that it will be managed by a Web application called GATOOL (Genome Assembly Tool). Aiming to evaluate the performance of the application, tests were carried out with two samples of prokaryotic organisms, which are: Bacillus amyloliquefaciens and Serratia marcescens. Also perform a test with seven SRA samples. Both organisms are sequenced on the Ion PGMTM platform. The tools used to perform the assembly were SPAdes and Velvet, both assemblers use de Bruijn graph algorithm as a paradigm for the assembly of the genome, after this stage the resulting set of contigs was ordered through the CONTIGuator, which is a reference ordering. We observed that the interface GATOOL allowed a quick and easy execution of several steps and processes in the field of genome assembly, including the assembly of two prokaryotic species in an automated way, thus facilitating the use and accomplishment of such processes by any user.A montagem de genomas bacterianos ? um processo de reordena??o de fragmentos, de forma que se possa representar o genoma original. Entretanto, para que a montagem de um genoma seja realizada visando maximizar os resultados, ? preciso que algumas etapas sejam cumpridas, por exemplo: a an?lise dos fragmentos, o pr?-processamento destes fragmentos e novamente uma repeti??o do processo de an?lise, para verificar a efic?cia do pr?-processamento realizado. O processo de montagem de genomas ? uma etapa complexa, que envolve o tipo de sequenciamento que foi utilizado. Existem diversos tipos de sequenciadores, o que implica caracter?sticas distintas em cada um, como por exemplo: tamanho dos fragmentos, quantidade de fragmentos gerados por corrida, dentre outros. Analisando essas caracter?sticas, faz-se necess?ria a utiliza??o de diversas ferramentas computacionais para auxiliar a todos os processos citados anteriormente e, como a gama de softwares dispon?veis ? bem ampla e distinta, ? importante que o usu?rio domine essa diversidade computacional, contendo muitas vezes conhecimentos que n?o s?o da ?rea biol?gica, implicando menos tempo para um aprofundamento das quest?es biol?gicas. Com base neste contexto, prop?em-se um pipeline para a realiza??o da an?lise de fragmentos, pr?-processamento dos fragmentos, montagem de genomas e orienta??o de contigs, tendo como a montagem o objetivo principal do pipeline e este ser? gerenciado por uma aplica??o web chamada GATOOL (Genome Assembly Tool). Visando avaliar o desempenho da aplica??o, foram feitos testes com duas amostras de organismos procariontes, que s?o: Bacillus amyloliquefaciens e Serratia marcescens. Tamb?m foram realizados testes com sete amostras SRA. Ambos os organismos est?o sequenciados na plataforma Ion PGMTM. Os montadores usados foram o SPAdes e o Velvet, ambos montadores, utilizam o algor?tmo grafo de Bruijn como paradigma para a montagem do genoma; ap?s esta etapa, o conjunto de contigs resultante foi ordenado atrav?s do CONTIGuator, que ? uma ordena??o por refer?ncia. Observamos que a interface GATOOL permitiu uma execu??o r?pida e f?cil de diversas etapas e processos no campo da montagem de genomas, inclusive realizando a montagem de duas esp?cies procariontes de maneira automatizada, facilitando assim a utiliza??o e realiza??o de tais processos por qualquer usu?rio.Submitted by Ricardo Cedraz Duque Moliterno (ricardo.moliterno@uefs.br) on 2017-10-09T22:34:41Z No. of bitstreams: 1 MATHUES BRITO DE OLIVEIRA Disserta??ov.pdf: 5287293 bytes, checksum: 8d3e3b854b5799f16c0b61b6a5d33f1c (MD5)Made available in DSpace on 2017-10-09T22:34:41Z (GMT). No. of bitstreams: 1 MATHUES BRITO DE OLIVEIRA Disserta??ov.pdf: 5287293 bytes, checksum: 8d3e3b854b5799f16c0b61b6a5d33f1c (MD5) Previous issue date: 2017-06-12application/pdfhttp://tede2.uefs.br:8080/retrieve/5685/MATHUES%20BRITO%20DE%20OLIVEIRA%20Disserta%c3%a7%c3%a3ov.pdf.jpgporUniversidade Estadual de Feira de SantanaMestrado em Computa??o AplicadaUEFSBrasilDEPARTAMENTO DE CI?NCIAS EXATASGenome assemblyBacterialNGSPipelineMontagem de genomaBact?riaCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOGATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianosinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis303317282311144204600600600-54868328166115062113671711205811204509info:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UEFSinstname:Universidade Estadual de Feira de Santana (UEFS)instacron:UEFSTHUMBNAILMATHUES BRITO DE OLIVEIRA Disserta??ov.pdf.jpgMATHUES BRITO DE OLIVEIRA Disserta??ov.pdf.jpgimage/jpeg3853http://tede2.uefs.br:8080/bitstream/tede/513/4/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf.jpge3ecb98d5b97650ae6c263ba585c6c63MD54TEXTMATHUES BRITO DE OLIVEIRA Disserta??ov.pdf.txtMATHUES BRITO DE OLIVEIRA Disserta??ov.pdf.txttext/plain182918http://tede2.uefs.br:8080/bitstream/tede/513/3/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf.txt514827d8eb3123198f8e80efbd69b72eMD53ORIGINALMATHUES BRITO DE OLIVEIRA Disserta??ov.pdfMATHUES BRITO DE OLIVEIRA Disserta??ov.pdfapplication/pdf5287293http://tede2.uefs.br:8080/bitstream/tede/513/2/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf8d3e3b854b5799f16c0b61b6a5d33f1cMD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82089http://tede2.uefs.br:8080/bitstream/tede/513/1/license.txt7b5ba3d2445355f386edab96125d42b7MD51tede/5132025-09-10 01:15:25.592oai:tede2.uefs.br:8080:tede/513Tk9UQTogQ09MT1FVRSBBUVVJIEEgU1VBIFBSP1BSSUEgTElDRU4/QQpFc3RhIGxpY2VuP2EgZGUgZXhlbXBsbyA/IGZvcm5lY2lkYSBhcGVuYXMgcGFyYSBmaW5zIGluZm9ybWF0aXZvcy4KCkxJQ0VOP0EgREUgRElTVFJJQlVJPz9PIE4/Ty1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YT8/byBkZXN0YSBsaWNlbj9hLCB2b2M/IChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSA/IFVuaXZlcnNpZGFkZSAKWFhYIChTaWdsYSBkYSBVbml2ZXJzaWRhZGUpIG8gZGlyZWl0byBuP28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsICB0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSwgZS9vdSAKZGlzdHJpYnVpciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhPz9vIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyP25pY28gZSAKZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zID91ZGlvIG91IHY/ZGVvLgoKVm9jPyBjb25jb3JkYSBxdWUgYSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250ZT9kbywgdHJhbnNwb3IgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byAKcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhPz9vLgoKVm9jPyB0YW1iP20gY29uY29yZGEgcXVlIGEgU2lnbGEgZGUgVW5pdmVyc2lkYWRlIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGM/cGlhIGEgc3VhIHRlc2Ugb3UgCmRpc3NlcnRhPz9vIHBhcmEgZmlucyBkZSBzZWd1cmFuP2EsIGJhY2stdXAgZSBwcmVzZXJ2YT8/by4KClZvYz8gZGVjbGFyYSBxdWUgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byA/IG9yaWdpbmFsIGUgcXVlIHZvYz8gdGVtIG8gcG9kZXIgZGUgY29uY2VkZXIgb3MgZGlyZWl0b3MgY29udGlkb3MgCm5lc3RhIGxpY2VuP2EuIFZvYz8gdGFtYj9tIGRlY2xhcmEgcXVlIG8gZGVwP3NpdG8gZGEgc3VhIHRlc2Ugb3UgZGlzc2VydGE/P28gbj9vLCBxdWUgc2VqYSBkZSBzZXUgCmNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3U/bS4KCkNhc28gYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YT8/byBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jPyBuP28gcG9zc3VpIGEgdGl0dWxhcmlkYWRlIGRvcyBkaXJlaXRvcyBhdXRvcmFpcywgdm9jPyAKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzcz9vIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgPyBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgCm9zIGRpcmVpdG9zIGFwcmVzZW50YWRvcyBuZXN0YSBsaWNlbj9hLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3Q/IGNsYXJhbWVudGUgCmlkZW50aWZpY2FkbyBlIHJlY29uaGVjaWRvIG5vIHRleHRvIG91IG5vIGNvbnRlP2RvIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28gb3JhIGRlcG9zaXRhZGEuCgpDQVNPIEEgVEVTRSBPVSBESVNTRVJUQT8/TyBPUkEgREVQT1NJVEFEQSBURU5IQSBTSURPIFJFU1VMVEFETyBERSBVTSBQQVRST0M/TklPIE9VIApBUE9JTyBERSBVTUEgQUc/TkNJQSBERSBGT01FTlRPIE9VIE9VVFJPIE9SR0FOSVNNTyBRVUUgTj9PIFNFSkEgQSBTSUdMQSBERSAKVU5JVkVSU0lEQURFLCBWT0M/IERFQ0xBUkEgUVVFIFJFU1BFSVRPVSBUT0RPUyBFIFFVQUlTUVVFUiBESVJFSVRPUyBERSBSRVZJUz9PIENPTU8gClRBTUI/TSBBUyBERU1BSVMgT0JSSUdBPz9FUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKQSBTaWdsYSBkZSBVbml2ZXJzaWRhZGUgc2UgY29tcHJvbWV0ZSBhIGlkZW50aWZpY2FyIGNsYXJhbWVudGUgbyBzZXUgbm9tZSAocykgb3UgbyhzKSBub21lKHMpIGRvKHMpIApkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIHRlc2Ugb3UgZGlzc2VydGE/P28sIGUgbj9vIGZhcj8gcXVhbHF1ZXIgYWx0ZXJhPz9vLCBhbD9tIGRhcXVlbGFzIApjb25jZWRpZGFzIHBvciBlc3RhIGxpY2VuP2EuCg==Biblioteca Digital de Teses e Dissertaçõeshttp://tede2.uefs.br:8080/PUBhttp://tede2.uefs.br:8080/oai/requestbcuefs@uefs.br|| bcref@uefs.br||bcuefs@uefs.bropendoar:2025-09-10T04:15:25Biblioteca Digital de Teses e Dissertações da UEFS - Universidade Estadual de Feira de Santana (UEFS)false
dc.title.por.fl_str_mv GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
spellingShingle GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
Oliveira, Matheus Brito de
Genome assembly
Bacterial
NGS
Pipeline
Montagem de genoma
Bact?ria
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_full GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_fullStr GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_full_unstemmed GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
title_sort GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos
author Oliveira, Matheus Brito de
author_facet Oliveira, Matheus Brito de
author_role author
dc.contributor.advisor1.fl_str_mv Queir?z, Artur Trancoso Lopo
dc.contributor.authorID.fl_str_mv 01493806580
dc.contributor.authorLattes.fl_str_mv http://lattes.cnpq.br/0008785408235675
dc.contributor.author.fl_str_mv Oliveira, Matheus Brito de
contributor_str_mv Queir?z, Artur Trancoso Lopo
dc.subject.eng.fl_str_mv Genome assembly
Bacterial
topic Genome assembly
Bacterial
NGS
Pipeline
Montagem de genoma
Bact?ria
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.por.fl_str_mv NGS
Pipeline
Montagem de genoma
Bact?ria
dc.subject.cnpq.fl_str_mv CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description The assembly of bacterial genomes consists of a process of reordering fragments so that the original genome can be represented. However, to maximize the results of genome assembly, some steps are required, for instance, read quality analysis and preprocessing, repetition identification and quality check. The process of assembly of genomes is a complex step that involves the type of sequencing that was used, there are several types of sequencers which imply different characteristics for each one for example: fragments size, throughput, among others. Analyzing these characteristics requires the use of several computational tools, to assist in all the processes mentioned above, and since the range of software available is quite broad and distinct, it is necessary for the user to learn to work with this computational diversity, dominating often knowledge that is not of the biological area, implying in less time for a deepening in biological questions. Based on this context, we developed a pipeline to perform an automated fragment analysis, read preprocessing, genome assembly and orientation of contigs, having as the assembly the main objective of the pipeline and that it will be managed by a Web application called GATOOL (Genome Assembly Tool). Aiming to evaluate the performance of the application, tests were carried out with two samples of prokaryotic organisms, which are: Bacillus amyloliquefaciens and Serratia marcescens. Also perform a test with seven SRA samples. Both organisms are sequenced on the Ion PGMTM platform. The tools used to perform the assembly were SPAdes and Velvet, both assemblers use de Bruijn graph algorithm as a paradigm for the assembly of the genome, after this stage the resulting set of contigs was ordered through the CONTIGuator, which is a reference ordering. We observed that the interface GATOOL allowed a quick and easy execution of several steps and processes in the field of genome assembly, including the assembly of two prokaryotic species in an automated way, thus facilitating the use and accomplishment of such processes by any user.
publishDate 2017
dc.date.accessioned.fl_str_mv 2017-10-09T22:34:41Z
dc.date.issued.fl_str_mv 2017-06-12
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv OLIVEIRA, Matheus Brito de. GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos. 2017. 95 f. Disserta??o (Mestrado em Computa??o Aplicada)- Universidade Estadual de Feira de Santana, Feira de Santana, 2017.
dc.identifier.uri.fl_str_mv http://localhost:8080/tede/handle/tede/513
identifier_str_mv OLIVEIRA, Matheus Brito de. GATOOL - Genome Assembly Tool: uma ferramenta web para montagem de genomas bacterianos. 2017. 95 f. Disserta??o (Mestrado em Computa??o Aplicada)- Universidade Estadual de Feira de Santana, Feira de Santana, 2017.
url http://localhost:8080/tede/handle/tede/513
dc.language.iso.fl_str_mv por
language por
dc.relation.program.fl_str_mv 303317282311144204
dc.relation.confidence.fl_str_mv 600
600
600
dc.relation.department.fl_str_mv -5486832816611506211
dc.relation.cnpq.fl_str_mv 3671711205811204509
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Estadual de Feira de Santana
dc.publisher.program.fl_str_mv Mestrado em Computa??o Aplicada
dc.publisher.initials.fl_str_mv UEFS
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv DEPARTAMENTO DE CI?NCIAS EXATAS
publisher.none.fl_str_mv Universidade Estadual de Feira de Santana
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da UEFS
instname:Universidade Estadual de Feira de Santana (UEFS)
instacron:UEFS
instname_str Universidade Estadual de Feira de Santana (UEFS)
instacron_str UEFS
institution UEFS
reponame_str Biblioteca Digital de Teses e Dissertações da UEFS
collection Biblioteca Digital de Teses e Dissertações da UEFS
bitstream.url.fl_str_mv http://tede2.uefs.br:8080/bitstream/tede/513/4/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf.jpg
http://tede2.uefs.br:8080/bitstream/tede/513/3/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf.txt
http://tede2.uefs.br:8080/bitstream/tede/513/2/MATHUES+BRITO+DE+OLIVEIRA+Disserta%C3%A7%C3%A3ov.pdf
http://tede2.uefs.br:8080/bitstream/tede/513/1/license.txt
bitstream.checksum.fl_str_mv e3ecb98d5b97650ae6c263ba585c6c63
514827d8eb3123198f8e80efbd69b72e
8d3e3b854b5799f16c0b61b6a5d33f1c
7b5ba3d2445355f386edab96125d42b7
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da UEFS - Universidade Estadual de Feira de Santana (UEFS)
repository.mail.fl_str_mv bcuefs@uefs.br|| bcref@uefs.br||bcuefs@uefs.br
_version_ 1865375236066639872