Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.

Detalhes bibliográficos
Ano de defesa: 2023
Autor(a) principal: Gabriel Corrêa Veríssimo
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Minas Gerais
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://hdl.handle.net/1843/76020
Resumo: The emergence of multidrug-resistant bacteria is a severe health problem worldwide. The World Health Organization (WHO) categorizes carbapenem-resistant strains of Escherichia coli and methicillin/vancomycin-resistant strains of Staphylococcus aureus as critical and high-level priorities for antibacterial drug development. Computational methods, as molecular docking, are accelerating the development of new antibacterials; however, predictability of molecular docking can be improved with machine learning (ML). In this context, the aim with this work is to integrate ML models with molecular docking studies to enhance predictive ability of ligand inhibitory activity against both S. aureus and E. coli FabI enzymes. Additionally, the aim is also to obtain recombinant FabI enzymes from both species for in vitro assays using compounds selected through virtual screening., and to obtain the S. aureus e E. coli FabI recombinant enzymes for in vitro assays with selected compounds based on ML-based virtual screening. Therefore, 2,352 docking protocols were validated using redocking, crossdocking, and ROC curve analyses. Eleven ML algorithms were deployed, generating 220,856,328 classification models correlating inhibitory activity with interaction fingerprints, calculated based on docking poses. We selected the best models for each enzyme using several classification metrics. The top three models for each protein were then utilized in a virtual screening to select substances that were subsequently tested against both E. coli and S. aureus cells. To obtain the recombinant proteins, we transformed Escherichia coli BL21(DE3) cells via electroporation with pET28a-saFabI and pET29a-ecFabI plasmids, each containing the coding sequences for the proteins of interest. These cells were cultured at 37 ºC in Luria-Bertani medium supplemented with MgSO4 and kanamycin. IPTG was used to induce expression for a duration of 18 hours at 18 ºC. Cell lysis was carried out by sonication, and protein purification was performed using affinity, desalting, and size exclusion chromatography. Both proteins were obtained in the form of highly pure and stable tetramers. With regard to the machine learning results, the top three models of each enzyme yielded MCCint values ranging from 0.567 to 0.846, and MCCext values ranging from 0.638 to 1.000. The Support Vector Machine (SVM) and Multilayer Perceptron (MLP) algorithms produced the best results, outperforming docking in all metrics These models were used in a virtual screening, allowing nine active compounds to be obtained in in vitro assays against E. coli and S. aureus cells. Subsequent STD-NMR studies will be conducted to verify the interaction of these substances with the obtained proteins.
id UFMG_c79097d0006fc6ef01f6ca8c4cf1a7cd
oai_identifier_str oai:repositorio.ufmg.br:1843/76020
network_acronym_str UFMG
network_name_str Repositório Institucional da UFMG
repository_id_str
spelling Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.Integration of machine learning algorithms with molecular docking to design inhibitors for in vitro testing against the enoyl-ACP-reductase NAD(P)H-dependent enzyme (FabI) from Staphylococcus aureus and Escherichia coli.Aprendizado de máquinaDocking molecularPlanejamento de antibacterianosExpressão e purificação de enzimasEnoil-ACP-redutase (FabI)The emergence of multidrug-resistant bacteria is a severe health problem worldwide. The World Health Organization (WHO) categorizes carbapenem-resistant strains of Escherichia coli and methicillin/vancomycin-resistant strains of Staphylococcus aureus as critical and high-level priorities for antibacterial drug development. Computational methods, as molecular docking, are accelerating the development of new antibacterials; however, predictability of molecular docking can be improved with machine learning (ML). In this context, the aim with this work is to integrate ML models with molecular docking studies to enhance predictive ability of ligand inhibitory activity against both S. aureus and E. coli FabI enzymes. Additionally, the aim is also to obtain recombinant FabI enzymes from both species for in vitro assays using compounds selected through virtual screening., and to obtain the S. aureus e E. coli FabI recombinant enzymes for in vitro assays with selected compounds based on ML-based virtual screening. Therefore, 2,352 docking protocols were validated using redocking, crossdocking, and ROC curve analyses. Eleven ML algorithms were deployed, generating 220,856,328 classification models correlating inhibitory activity with interaction fingerprints, calculated based on docking poses. We selected the best models for each enzyme using several classification metrics. The top three models for each protein were then utilized in a virtual screening to select substances that were subsequently tested against both E. coli and S. aureus cells. To obtain the recombinant proteins, we transformed Escherichia coli BL21(DE3) cells via electroporation with pET28a-saFabI and pET29a-ecFabI plasmids, each containing the coding sequences for the proteins of interest. These cells were cultured at 37 ºC in Luria-Bertani medium supplemented with MgSO4 and kanamycin. IPTG was used to induce expression for a duration of 18 hours at 18 ºC. Cell lysis was carried out by sonication, and protein purification was performed using affinity, desalting, and size exclusion chromatography. Both proteins were obtained in the form of highly pure and stable tetramers. With regard to the machine learning results, the top three models of each enzyme yielded MCCint values ranging from 0.567 to 0.846, and MCCext values ranging from 0.638 to 1.000. The Support Vector Machine (SVM) and Multilayer Perceptron (MLP) algorithms produced the best results, outperforming docking in all metrics These models were used in a virtual screening, allowing nine active compounds to be obtained in in vitro assays against E. coli and S. aureus cells. Subsequent STD-NMR studies will be conducted to verify the interaction of these substances with the obtained proteins.CAPES - Coordenação de Aperfeiçoamento de Pessoal de Nível SuperiorUniversidade Federal de Minas Gerais2024-09-09T16:06:11Z2025-09-08T23:24:52Z2024-09-09T16:06:11Z2023-10-31info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/1843/76020porhttp://creativecommons.org/licenses/by-nc-nd/3.0/pt/info:eu-repo/semantics/openAccessGabriel Corrêa Veríssimoreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMG2025-09-08T23:24:52Zoai:repositorio.ufmg.br:1843/76020Repositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2025-09-08T23:24:52Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
Integration of machine learning algorithms with molecular docking to design inhibitors for in vitro testing against the enoyl-ACP-reductase NAD(P)H-dependent enzyme (FabI) from Staphylococcus aureus and Escherichia coli.
title Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
spellingShingle Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
Gabriel Corrêa Veríssimo
Aprendizado de máquina
Docking molecular
Planejamento de antibacterianos
Expressão e purificação de enzimas
Enoil-ACP-redutase (FabI)
title_short Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
title_full Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
title_fullStr Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
title_full_unstemmed Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
title_sort Integração de algoritmos de aprendizado de máquina ao docking molecular para planejamento e realização de ensaios in vitro de inibidores da enzima enoil-ACP-redutase NAD(P)H-dependente (FabI) de Staphylococcus aureus e de Escherichia coli.
author Gabriel Corrêa Veríssimo
author_facet Gabriel Corrêa Veríssimo
author_role author
dc.contributor.author.fl_str_mv Gabriel Corrêa Veríssimo
dc.subject.por.fl_str_mv Aprendizado de máquina
Docking molecular
Planejamento de antibacterianos
Expressão e purificação de enzimas
Enoil-ACP-redutase (FabI)
topic Aprendizado de máquina
Docking molecular
Planejamento de antibacterianos
Expressão e purificação de enzimas
Enoil-ACP-redutase (FabI)
description The emergence of multidrug-resistant bacteria is a severe health problem worldwide. The World Health Organization (WHO) categorizes carbapenem-resistant strains of Escherichia coli and methicillin/vancomycin-resistant strains of Staphylococcus aureus as critical and high-level priorities for antibacterial drug development. Computational methods, as molecular docking, are accelerating the development of new antibacterials; however, predictability of molecular docking can be improved with machine learning (ML). In this context, the aim with this work is to integrate ML models with molecular docking studies to enhance predictive ability of ligand inhibitory activity against both S. aureus and E. coli FabI enzymes. Additionally, the aim is also to obtain recombinant FabI enzymes from both species for in vitro assays using compounds selected through virtual screening., and to obtain the S. aureus e E. coli FabI recombinant enzymes for in vitro assays with selected compounds based on ML-based virtual screening. Therefore, 2,352 docking protocols were validated using redocking, crossdocking, and ROC curve analyses. Eleven ML algorithms were deployed, generating 220,856,328 classification models correlating inhibitory activity with interaction fingerprints, calculated based on docking poses. We selected the best models for each enzyme using several classification metrics. The top three models for each protein were then utilized in a virtual screening to select substances that were subsequently tested against both E. coli and S. aureus cells. To obtain the recombinant proteins, we transformed Escherichia coli BL21(DE3) cells via electroporation with pET28a-saFabI and pET29a-ecFabI plasmids, each containing the coding sequences for the proteins of interest. These cells were cultured at 37 ºC in Luria-Bertani medium supplemented with MgSO4 and kanamycin. IPTG was used to induce expression for a duration of 18 hours at 18 ºC. Cell lysis was carried out by sonication, and protein purification was performed using affinity, desalting, and size exclusion chromatography. Both proteins were obtained in the form of highly pure and stable tetramers. With regard to the machine learning results, the top three models of each enzyme yielded MCCint values ranging from 0.567 to 0.846, and MCCext values ranging from 0.638 to 1.000. The Support Vector Machine (SVM) and Multilayer Perceptron (MLP) algorithms produced the best results, outperforming docking in all metrics These models were used in a virtual screening, allowing nine active compounds to be obtained in in vitro assays against E. coli and S. aureus cells. Subsequent STD-NMR studies will be conducted to verify the interaction of these substances with the obtained proteins.
publishDate 2023
dc.date.none.fl_str_mv 2023-10-31
2024-09-09T16:06:11Z
2024-09-09T16:06:11Z
2025-09-08T23:24:52Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1843/76020
url https://hdl.handle.net/1843/76020
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv http://creativecommons.org/licenses/by-nc-nd/3.0/pt/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc-nd/3.0/pt/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFMG
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Repositório Institucional da UFMG
collection Repositório Institucional da UFMG
repository.name.fl_str_mv Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv repositorio@ufmg.br
_version_ 1856414035826704384