On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops

Detalhes bibliográficos
Ano de defesa: 2022
Autor(a) principal: Machado, Ingrid Pinheiro
Orientador(a): Silva, Júlio César do Vale
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Não Informado pela instituição
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
GBS
Link de acesso: http://www.repositorio.ufc.br/handle/riufc/69221
Resumo: The breeding programs cross-pollinated crops develop thousands of lines that, when combined, generate single-crosses that need to be evaluated for their performance in different sites, making this step the most expensive in released new cultivars. The molecular markers have proved to be a powerful tool in improving economically essential crops to accelerate this process. Currently, there are several genotyping platforms capable of providing thousands of SNP (Single Nucleotide Polymorphism) for performing genomic studies. However, the adoption of modern genomic enhancement for crops that do not yet have a reference genome is limited. Genotyping by sequencing (GBS) has emerged as an alternative to make such technologies viable for orphan crops. Once with these data, it is possible to build a simulated genome to perform the SNP calling where the discovery of polymorphisms will be intrinsic to the population under study without using an external genome. The term “orphan” is derived from the condition of neglect and helplessness of these crops by the scientific community, despite having great food and nutritional potential. Therefore, our goals were to verify whether the source of SNP can influence the assessment of the population structure of parental lines; ascertain if the SNP source can affect the determination of heterotic groups and the prediction of single-crosses performance, and to test if using GBS and the mock genome efficiently performs the SNP calling in orphan crops, the ones that don’t have reference genome available. For this, maize was used as a model species, where 330 parental lines were genotyped by two standard genotyping platforms, SNP-array and GBS. GBS data were used for two purposes, to perform the SNP calling using the parental line B73 (GBS-B73) as a reference genome and to build a mock genome (GBS-Mock) to perform the SNP calling without needing an external genome, making three genotyping scenarios: SNP-array, GBS-B73, and GBS-Mock. These scenarios were used to conduct studies of population structure and genetic diversity among parental lines. After, we used phenotypic data of 751 single-crosses generated from the diallel of these parental lines. From there, genomic diallel analyses were performed to separate parental lines into heterotic groups and choose the best testers. Subsequently, an additive-dominant model was applied to predict the performance of single-crosses. The results showed that the GBS-Mock presented similar results to the standard population structure studies approach. The genotyping scenarios also did not differ in the division of heterotic groups and the definition of testers. In the genomic prediction study, GBS-Mock performed similarly to the SNP-array and GBS-B73. These results showed that a mock genome constructed from the population's intrinsic polymorphisms to perform the SNP calling is an excellent strategy to support plant breeders in studies of diversity, population structure, the definition of heterotic groups, choice of testers, and genomic prediction in species that still do not have a reference genome available. Because it is an alternative to the rapid advance of orphan crop breeding, in this context, genotyping via GBS associated with the mock genome is an effective alternative for performing genomic studies in orphan crops, especially those that do not have a reference genome.
id UFC-7_6fd16e6d07f9f2e4961918c450f479fa
oai_identifier_str oai:repositorio.ufc.br:riufc/69221
network_acronym_str UFC-7
network_name_str Repositório Institucional da Universidade Federal do Ceará (UFC)
repository_id_str
spelling Machado, Ingrid PinheiroFritsche Neto, RobertoSilva, Júlio César do Vale2022-11-10T15:00:47Z2022-11-10T15:00:47Z2022MACHADO, Ingrid Pinheiro. On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops. 2022. 65 f. Tese (Doutorado Agronomia/Fitotecnia) – Universidade Federal do Ceará, Fortaleza, 2022.http://www.repositorio.ufc.br/handle/riufc/69221The breeding programs cross-pollinated crops develop thousands of lines that, when combined, generate single-crosses that need to be evaluated for their performance in different sites, making this step the most expensive in released new cultivars. The molecular markers have proved to be a powerful tool in improving economically essential crops to accelerate this process. Currently, there are several genotyping platforms capable of providing thousands of SNP (Single Nucleotide Polymorphism) for performing genomic studies. However, the adoption of modern genomic enhancement for crops that do not yet have a reference genome is limited. Genotyping by sequencing (GBS) has emerged as an alternative to make such technologies viable for orphan crops. Once with these data, it is possible to build a simulated genome to perform the SNP calling where the discovery of polymorphisms will be intrinsic to the population under study without using an external genome. The term “orphan” is derived from the condition of neglect and helplessness of these crops by the scientific community, despite having great food and nutritional potential. Therefore, our goals were to verify whether the source of SNP can influence the assessment of the population structure of parental lines; ascertain if the SNP source can affect the determination of heterotic groups and the prediction of single-crosses performance, and to test if using GBS and the mock genome efficiently performs the SNP calling in orphan crops, the ones that don’t have reference genome available. For this, maize was used as a model species, where 330 parental lines were genotyped by two standard genotyping platforms, SNP-array and GBS. GBS data were used for two purposes, to perform the SNP calling using the parental line B73 (GBS-B73) as a reference genome and to build a mock genome (GBS-Mock) to perform the SNP calling without needing an external genome, making three genotyping scenarios: SNP-array, GBS-B73, and GBS-Mock. These scenarios were used to conduct studies of population structure and genetic diversity among parental lines. After, we used phenotypic data of 751 single-crosses generated from the diallel of these parental lines. From there, genomic diallel analyses were performed to separate parental lines into heterotic groups and choose the best testers. Subsequently, an additive-dominant model was applied to predict the performance of single-crosses. The results showed that the GBS-Mock presented similar results to the standard population structure studies approach. The genotyping scenarios also did not differ in the division of heterotic groups and the definition of testers. In the genomic prediction study, GBS-Mock performed similarly to the SNP-array and GBS-B73. These results showed that a mock genome constructed from the population's intrinsic polymorphisms to perform the SNP calling is an excellent strategy to support plant breeders in studies of diversity, population structure, the definition of heterotic groups, choice of testers, and genomic prediction in species that still do not have a reference genome available. Because it is an alternative to the rapid advance of orphan crop breeding, in this context, genotyping via GBS associated with the mock genome is an effective alternative for performing genomic studies in orphan crops, especially those that do not have a reference genome.The breeding programs cross-pollinated crops develop thousands of lines that, when combined, generate single-crosses that need to be evaluated for their performance in different sites, making this step the most expensive in released new cultivars. The molecular markers have proved to be a powerful tool in improving economically essential crops to accelerate this process. Currently, there are several genotyping platforms capable of providing thousands of SNP (Single Nucleotide Polymorphism) for performing genomic studies. However, the adoption of modern genomic enhancement for crops that do not yet have a reference genome is limited. Genotyping by sequencing (GBS) has emerged as an alternative to make such technologies viable for orphan crops. Once with these data, it is possible to build a simulated genome to perform the SNP calling where the discovery of polymorphisms will be intrinsic to the population under study without using an external genome. The term “orphan” is derived from the condition of neglect and helplessness of these crops by the scientific community, despite having great food and nutritional potential. Therefore, our goals were to verify whether the source of SNP can influence the assessment of the population structure of parental lines; ascertain if the SNP source can affect the determination of heterotic groups and the prediction of single-crosses performance, and to test if using GBS and the mock genome efficiently performs the SNP calling in orphan crops, the ones that don’t have reference genome available. For this, maize was used as a model species, where 330 parental lines were genotyped by two standard genotyping platforms, SNP-array and GBS. GBS data were used for two purposes, to perform the SNP calling using the parental line B73 (GBS-B73) as a reference genome and to build a mock genome (GBS-Mock) to perform the SNP calling without needing an external genome, making three genotyping scenarios: SNP-array, GBS-B73, and GBS-Mock. These scenarios were used to conduct studies of population structure and genetic diversity among parental lines. After, we used phenotypic data of 751 single-crosses generated from the diallel of these parental lines. From there, genomic diallel analyses were performed to separate parental lines into heterotic groups and choose the best testers. Subsequently, an additive-dominant model was applied to predict the performance of single-crosses. The results showed that the GBS-Mock presented similar results to the standard population structure studies approach. The genotyping scenarios also did not differ in the division of heterotic groups and the definition of testers. In the genomic prediction study, GBS-Mock performed similarly to the SNP-array and GBS-B73. These results showed that a mock genome constructed from the population's intrinsic polymorphisms to perform the SNP calling is an excellent strategy to support plant breeders in studies of diversity, population structure, the definition of heterotic groups, choice of testers, and genomic prediction in species that still do not have a reference genome available. Because it is an alternative to the rapid advance of orphan crop breeding, in this context, genotyping via GBS associated with the mock genome is an effective alternative for performing genomic studies in orphan crops, especially those that do not have a reference genome.GBSSNP-arrayFormation of heterotic groupsGenomic prediction of single-crossesMinor cropsUnderused cropsSimulated genomeOn the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan cropsOn the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan cropsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisporreponame:Repositório Institucional da Universidade Federal do Ceará (UFC)instname:Universidade Federal do Ceará (UFC)instacron:UFCinfo:eu-repo/semantics/openAccessLICENSElicense.txtlicense.txttext/plain; charset=utf-81748http://repositorio.ufc.br/bitstream/riufc/69221/4/license.txt8a4605be74aa9ea9d79846c1fba20a33MD54ORIGINAL2022_tese_ipmachado.pdf2022_tese_ipmachado.pdfapplication/pdf6519793http://repositorio.ufc.br/bitstream/riufc/69221/3/2022_tese_ipmachado.pdfd8e572299b083196e0a919d6f8ead395MD53riufc/692212022-11-10 12:01:12.792oai:repositorio.ufc.br:riufc/69221Tk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=Repositório InstitucionalPUBhttp://www.repositorio.ufc.br/ri-oai/requestbu@ufc.br || repositorio@ufc.bropendoar:2022-11-10T15:01:12Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)false
dc.title.pt_BR.fl_str_mv On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
dc.title.en.pt_BR.fl_str_mv On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
title On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
spellingShingle On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
Machado, Ingrid Pinheiro
GBS
SNP-array
Formation of heterotic groups
Genomic prediction of single-crosses
Minor crops
Underused crops
Simulated genome
title_short On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
title_full On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
title_fullStr On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
title_full_unstemmed On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
title_sort On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops
author Machado, Ingrid Pinheiro
author_facet Machado, Ingrid Pinheiro
author_role author
dc.contributor.co-advisor.none.fl_str_mv Fritsche Neto, Roberto
dc.contributor.author.fl_str_mv Machado, Ingrid Pinheiro
dc.contributor.advisor1.fl_str_mv Silva, Júlio César do Vale
contributor_str_mv Silva, Júlio César do Vale
dc.subject.por.fl_str_mv GBS
SNP-array
Formation of heterotic groups
Genomic prediction of single-crosses
Minor crops
Underused crops
Simulated genome
topic GBS
SNP-array
Formation of heterotic groups
Genomic prediction of single-crosses
Minor crops
Underused crops
Simulated genome
description The breeding programs cross-pollinated crops develop thousands of lines that, when combined, generate single-crosses that need to be evaluated for their performance in different sites, making this step the most expensive in released new cultivars. The molecular markers have proved to be a powerful tool in improving economically essential crops to accelerate this process. Currently, there are several genotyping platforms capable of providing thousands of SNP (Single Nucleotide Polymorphism) for performing genomic studies. However, the adoption of modern genomic enhancement for crops that do not yet have a reference genome is limited. Genotyping by sequencing (GBS) has emerged as an alternative to make such technologies viable for orphan crops. Once with these data, it is possible to build a simulated genome to perform the SNP calling where the discovery of polymorphisms will be intrinsic to the population under study without using an external genome. The term “orphan” is derived from the condition of neglect and helplessness of these crops by the scientific community, despite having great food and nutritional potential. Therefore, our goals were to verify whether the source of SNP can influence the assessment of the population structure of parental lines; ascertain if the SNP source can affect the determination of heterotic groups and the prediction of single-crosses performance, and to test if using GBS and the mock genome efficiently performs the SNP calling in orphan crops, the ones that don’t have reference genome available. For this, maize was used as a model species, where 330 parental lines were genotyped by two standard genotyping platforms, SNP-array and GBS. GBS data were used for two purposes, to perform the SNP calling using the parental line B73 (GBS-B73) as a reference genome and to build a mock genome (GBS-Mock) to perform the SNP calling without needing an external genome, making three genotyping scenarios: SNP-array, GBS-B73, and GBS-Mock. These scenarios were used to conduct studies of population structure and genetic diversity among parental lines. After, we used phenotypic data of 751 single-crosses generated from the diallel of these parental lines. From there, genomic diallel analyses were performed to separate parental lines into heterotic groups and choose the best testers. Subsequently, an additive-dominant model was applied to predict the performance of single-crosses. The results showed that the GBS-Mock presented similar results to the standard population structure studies approach. The genotyping scenarios also did not differ in the division of heterotic groups and the definition of testers. In the genomic prediction study, GBS-Mock performed similarly to the SNP-array and GBS-B73. These results showed that a mock genome constructed from the population's intrinsic polymorphisms to perform the SNP calling is an excellent strategy to support plant breeders in studies of diversity, population structure, the definition of heterotic groups, choice of testers, and genomic prediction in species that still do not have a reference genome available. Because it is an alternative to the rapid advance of orphan crop breeding, in this context, genotyping via GBS associated with the mock genome is an effective alternative for performing genomic studies in orphan crops, especially those that do not have a reference genome.
publishDate 2022
dc.date.accessioned.fl_str_mv 2022-11-10T15:00:47Z
dc.date.available.fl_str_mv 2022-11-10T15:00:47Z
dc.date.issued.fl_str_mv 2022
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv MACHADO, Ingrid Pinheiro. On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops. 2022. 65 f. Tese (Doutorado Agronomia/Fitotecnia) – Universidade Federal do Ceará, Fortaleza, 2022.
dc.identifier.uri.fl_str_mv http://www.repositorio.ufc.br/handle/riufc/69221
identifier_str_mv MACHADO, Ingrid Pinheiro. On the usefulness of mock genomes to define heterotic pools, testers, and hybrid predictions in orphan crops. 2022. 65 f. Tese (Doutorado Agronomia/Fitotecnia) – Universidade Federal do Ceará, Fortaleza, 2022.
url http://www.repositorio.ufc.br/handle/riufc/69221
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da Universidade Federal do Ceará (UFC)
instname:Universidade Federal do Ceará (UFC)
instacron:UFC
instname_str Universidade Federal do Ceará (UFC)
instacron_str UFC
institution UFC
reponame_str Repositório Institucional da Universidade Federal do Ceará (UFC)
collection Repositório Institucional da Universidade Federal do Ceará (UFC)
bitstream.url.fl_str_mv http://repositorio.ufc.br/bitstream/riufc/69221/4/license.txt
http://repositorio.ufc.br/bitstream/riufc/69221/3/2022_tese_ipmachado.pdf
bitstream.checksum.fl_str_mv 8a4605be74aa9ea9d79846c1fba20a33
d8e572299b083196e0a919d6f8ead395
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)
repository.mail.fl_str_mv bu@ufc.br || repositorio@ufc.br
_version_ 1847793307015446528