Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante

Rêgo, Thais Gaudencio do

Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante

Detalhes bibliográficos
Ano de defesa:	2008
Autor(a) principal:	Rêgo, Thais Gaudencio do
Orientador(a):	Dardenne, Laurent Emmanuel
Banca de defesa:	Raupp, Fernanda Maria Pereira , Sant'anna, Carlos Maurício Rabello de
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Laboratório Nacional de Computação Científica
Programa de Pós-Graduação:	Programa de Pós-Graduação em Modelagem Computacional
Departamento:	Serviço de Análise e Apoio a Formação de Recursos Humanos
País:	BR
Palavras-chave em Português:	Biomoléculas - Estrutura - Simulação por computador proteínas - Estrutura - Simulação por computador Atracamento molecular Desenvolvimento de fármacos Energia livre
Área do conhecimento CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Link de acesso:	https://tede.lncc.br/handle/tede/101
Resumo:	The understanding of receptor-ligand molecular recognition is one of the central aspects in structure-based design and discovery of new drugs. The key methodology is the docking of small molecules in active sites of proteins. There are two aims in any program of molecular docking: the search for the best ligand-protein conformation and the calculation of the free energy of this association, or its affinity constant. The test set used in this work was composed by 50 protein- ligand complexes, with experimentally measured Ki or Kd values for the construction of an empirical function specific to the DOCKTHOR program, using as input variables: energy of electrostatic interaction and Lennard-Jones, contact area of ligand-receptor on the surface accessible to the solvent, the presence of hydrogen bridges, and the number of the ligand rotatable bonds that were frozen in the process of docking. These variables were used for the construction of two types of free energy scoring functions. The importance of each variable used as input data for the construction of those functions was rated by means of multiple regression. A neural network was x also used to try to build the best model for the calculation of the affinity constant. The DOCKTHOR program currently has a prediction power leading to r = 0.4245, which indicates the importance of improving its scoring. The function built with the multiple regression methodology used 5 input variables and had linear, quadratic, and cross-product terms leading to r = 0.7542. Using the methodology of group cross-validation (VCG), it was concluded that the best architecture for the neural network consists of 9 neurons in the hidden layer, as it has the smallest error of generalization and greater consistency in errors. In the tests with this neural network architecture built using the same 50 protein-ligand complexes in training and test, 66% of the complexes had a difference smaller than 1.0 in the observed values. The generalization error (obtained by VCG) of a neural network that uses 9 neurons in the hidden layer was about ten times lower than that obtained by using a polynomial function. This is an indication of the superiority of the neural network methodology with respect to the multivariate regression methodology, specially for an empirical function developed for a broad range of receptor-ligand complexes.

Metadados do item

id	LNCC_a01236d5e27f330b5cac8d2b28d8d12a
oai_identifier_str	oai:tede-server.lncc.br:tede/101
network_acronym_str	LNCC
network_name_str	Biblioteca Digital de Teses e Dissertações do LNCC
repository_id_str
spelling	Dardenne, Laurent EmmanuelCPF:49809431104http://lattes.cnpq.br/8344194525615133Barbosa, Helio José CorrêaCPF:194 306 716 34http://lattes.cnpq.br/0375745110240885Raupp, Fernanda Maria PereiraCPF:00000000111http://lattes.cnpq.br/6932171005996406Sant'anna, Carlos Maurício Rabello deCPF:82723222772SANT\'ANNA, Carlos Maurício Rabello deCPF:012117554http://lattes.cnpq.br/3166390632199101Rêgo, Thais Gaudencio do2015-03-04T18:51:07Z2009-06-182008-08-25RÊGO, Thais Gaudencio do. Construction of Empirical Scoring Functions Using Artificial Neural Network for Determination of Affinites Constants Between Receptor-Ligand. 2008. 162 f. Dissertação (Mestrado em Modelagem computacional) - Laboratório Nacional de Computação Científica, Petrópolis, 2008.https://tede.lncc.br/handle/tede/101The understanding of receptor-ligand molecular recognition is one of the central aspects in structure-based design and discovery of new drugs. The key methodology is the docking of small molecules in active sites of proteins. There are two aims in any program of molecular docking: the search for the best ligand-protein conformation and the calculation of the free energy of this association, or its affinity constant. The test set used in this work was composed by 50 protein- ligand complexes, with experimentally measured Ki or Kd values for the construction of an empirical function specific to the DOCKTHOR program, using as input variables: energy of electrostatic interaction and Lennard-Jones, contact area of ligand-receptor on the surface accessible to the solvent, the presence of hydrogen bridges, and the number of the ligand rotatable bonds that were frozen in the process of docking. These variables were used for the construction of two types of free energy scoring functions. The importance of each variable used as input data for the construction of those functions was rated by means of multiple regression. A neural network was x also used to try to build the best model for the calculation of the affinity constant. The DOCKTHOR program currently has a prediction power leading to r = 0.4245, which indicates the importance of improving its scoring. The function built with the multiple regression methodology used 5 input variables and had linear, quadratic, and cross-product terms leading to r = 0.7542. Using the methodology of group cross-validation (VCG), it was concluded that the best architecture for the neural network consists of 9 neurons in the hidden layer, as it has the smallest error of generalization and greater consistency in errors. In the tests with this neural network architecture built using the same 50 protein-ligand complexes in training and test, 66% of the complexes had a difference smaller than 1.0 in the observed values. The generalization error (obtained by VCG) of a neural network that uses 9 neurons in the hidden layer was about ten times lower than that obtained by using a polynomial function. This is an indication of the superiority of the neural network methodology with respect to the multivariate regression methodology, specially for an empirical function developed for a broad range of receptor-ligand complexes.A compreensão dos mecanismos de reconhecimento molecular receptor-ligante é um dos aspectos centrais na descoberta e planejamento de novos fármacos baseado em estrutura. Uma metodologia chave é o atracamento de pequenas moléculas em sítios de ligação de proteínas, o atracamento molecular (em inglês, "molecular docking"). Existem dois pontos chaves em qualquer programa de atracamento: a busca da "melhor" conformação ligante-proteína e o cálculo da energia livre desta associação, ou sua constante de afinidade. Foi construído neste trabalho um conjunto-teste formado por 50 complexos proteína-ligante, com valores de K i ou K d determinados experimentalmente, para a construção de uma função empírica específica para o programa DOCKTHOR, utilizando como variáveis de entrada valores de energias de interação eletrostática e de Lennard-Jones, área de contato ligante-receptor da superfície acessível ao solvente, presença de ligações hidrogênio, e o número de ligações torcionáveis do ligante. Estes variáveis foram utilizados para a construção de dois tipos de funções de cálculo de energia livre. Através de regressão múltipla, foi avaliada a importância de cada uma das variáveis utilizadas como dados de entrada na construção desta função. Utilizando uma rede neural, buscou-se construir o melhor modelo para o cálculo de constantes de afinidade. O programa DOCKTHOR atualmente tem poder de predição correspondente a r = viii 0,4245, o que mostra a importância de se melhorar sua função de avaliação. A função construída com a metodologia de regressão múltipla que obteve melhor resultado foi a que utilizou as 5 variáveis de entrada apresentando termos lineares, cruzados e quadráticos, com r igual a 0,7542. Funções empíricas construídas por redes neurais também foram avaliadas neste trabalho. Utilizando a metodologia de validação cruzada de grupo (VCG) chegou-se à conclusão que a melhor arquitetura para a rede neural é constituída por 9 neurônios na camada oculta, pois possui o menor erro de generalização e a maior homogeneidade nos erros. No teste com esta arquitetura de rede neural, com a função construída utilizando os 50 complexos proteína- ligante no treinamento e os mesmos, no teste, observamos que 66% dos complexos tiveram uma diferença menor que 1,0 dos valores observados em relação aos esperados. O erro de generalização, obtido por VCG, de uma rede neural utilizando 9 neurônios na camada oculta foi cerca de dez vezes menor ao obtido utilizando uma função polinomial. Isto é um indicativo da superioridade da metodologia de rede neural, com relação a metodologia de regressão multivariada, principalmente em uma função empírica desenvolvida para estimar afinidades relativas à uma ampla gama de complexos receptor-ligante.Made available in DSpace on 2015-03-04T18:51:07Z (GMT). No. of bitstreams: 1 dissertacao final.pdf: 1262049 bytes, checksum: 07f1ebc0954cc7f649c905b7d60adcf7 (MD5) Previous issue date: 2008-08-25Fundação Carlos Chagas Filho de Amparo a Pesquisa do Estado do Rio de Janeiroapplication/pdfhttp://tede-server.lncc.br:8080/retrieve/490/dissertacao%20final.pdf.jpghttp://tede-server.lncc.br:8080/retrieve/705/dissertacao%20final.pdf.jpgporLaboratório Nacional de Computação CientíficaPrograma de Pós-Graduação em Modelagem ComputacionalLNCCBRServiço de Análise e Apoio a Formação de Recursos HumanosBiomoléculas - Estrutura - Simulação por computadorproteínas - Estrutura - Simulação por computadorAtracamento molecularDesenvolvimento de fármacosEnergia livreCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOConstrução de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-LiganteConstruction of Empirical Scoring Functions Using Artificial Neural Network for Determination of Affinites Constants Between Receptor-Ligandinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações do LNCCinstname:Laboratório Nacional de Computação Científica (LNCC)instacron:LNCCORIGINALdissertacao final.pdfapplication/pdf1262049http://tede-server.lncc.br:8080/tede/bitstream/tede/101/1/dissertacao+final.pdf07f1ebc0954cc7f649c905b7d60adcf7MD51THUMBNAILdissertacao final.pdf.jpgdissertacao final.pdf.jpgimage/jpeg3587http://tede-server.lncc.br:8080/tede/bitstream/tede/101/2/dissertacao+final.pdf.jpg5267d219141e4fdfcfca2da11f90ce40MD52tede/1012018-07-04 09:59:44.413oai:tede-server.lncc.br:tede/101Biblioteca Digital de Teses e Dissertaçõeshttps://tede.lncc.br/PUBhttps://tede.lncc.br/oai/requestlibrary@lncc.br\|\|library@lncc.bropendoar:2018-07-04T12:59:44Biblioteca Digital de Teses e Dissertações do LNCC - Laboratório Nacional de Computação Científica (LNCC)false
dc.title.por.fl_str_mv	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante
dc.title.alternative.eng.fl_str_mv	Construction of Empirical Scoring Functions Using Artificial Neural Network for Determination of Affinites Constants Between Receptor-Ligand
title	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante
spellingShingle	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante Rêgo, Thais Gaudencio do Biomoléculas - Estrutura - Simulação por computador proteínas - Estrutura - Simulação por computador Atracamento molecular Desenvolvimento de fármacos Energia livre CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante
title_full	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante
title_fullStr	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante
title_full_unstemmed	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante
title_sort	Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante
author	Rêgo, Thais Gaudencio do
author_facet	Rêgo, Thais Gaudencio do
author_role	author
dc.contributor.advisor1.fl_str_mv	Dardenne, Laurent Emmanuel
dc.contributor.advisor1ID.fl_str_mv	CPF:49809431104
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/8344194525615133
dc.contributor.advisor-co1.fl_str_mv	Barbosa, Helio José Corrêa
dc.contributor.advisor-co1ID.fl_str_mv	CPF:194 306 716 34
dc.contributor.advisor-co1Lattes.fl_str_mv	http://lattes.cnpq.br/0375745110240885
dc.contributor.referee1.fl_str_mv	Raupp, Fernanda Maria Pereira
dc.contributor.referee1ID.fl_str_mv	CPF:00000000111
dc.contributor.referee1Lattes.fl_str_mv	http://lattes.cnpq.br/6932171005996406
dc.contributor.referee2.fl_str_mv	Sant'anna, Carlos Maurício Rabello de
dc.contributor.referee2ID.fl_str_mv	CPF:82723222772
dc.contributor.referee2Lattes.fl_str_mv	SANT\'ANNA, Carlos Maurício Rabello de
dc.contributor.authorID.fl_str_mv	CPF:012117554
dc.contributor.authorLattes.fl_str_mv	http://lattes.cnpq.br/3166390632199101
dc.contributor.author.fl_str_mv	Rêgo, Thais Gaudencio do
contributor_str_mv	Dardenne, Laurent Emmanuel Barbosa, Helio José Corrêa Raupp, Fernanda Maria Pereira Sant'anna, Carlos Maurício Rabello de
dc.subject.por.fl_str_mv	Biomoléculas - Estrutura - Simulação por computador proteínas - Estrutura - Simulação por computador Atracamento molecular Desenvolvimento de fármacos Energia livre
topic	Biomoléculas - Estrutura - Simulação por computador proteínas - Estrutura - Simulação por computador Atracamento molecular Desenvolvimento de fármacos Energia livre CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.cnpq.fl_str_mv	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	The understanding of receptor-ligand molecular recognition is one of the central aspects in structure-based design and discovery of new drugs. The key methodology is the docking of small molecules in active sites of proteins. There are two aims in any program of molecular docking: the search for the best ligand-protein conformation and the calculation of the free energy of this association, or its affinity constant. The test set used in this work was composed by 50 protein- ligand complexes, with experimentally measured Ki or Kd values for the construction of an empirical function specific to the DOCKTHOR program, using as input variables: energy of electrostatic interaction and Lennard-Jones, contact area of ligand-receptor on the surface accessible to the solvent, the presence of hydrogen bridges, and the number of the ligand rotatable bonds that were frozen in the process of docking. These variables were used for the construction of two types of free energy scoring functions. The importance of each variable used as input data for the construction of those functions was rated by means of multiple regression. A neural network was x also used to try to build the best model for the calculation of the affinity constant. The DOCKTHOR program currently has a prediction power leading to r = 0.4245, which indicates the importance of improving its scoring. The function built with the multiple regression methodology used 5 input variables and had linear, quadratic, and cross-product terms leading to r = 0.7542. Using the methodology of group cross-validation (VCG), it was concluded that the best architecture for the neural network consists of 9 neurons in the hidden layer, as it has the smallest error of generalization and greater consistency in errors. In the tests with this neural network architecture built using the same 50 protein-ligand complexes in training and test, 66% of the complexes had a difference smaller than 1.0 in the observed values. The generalization error (obtained by VCG) of a neural network that uses 9 neurons in the hidden layer was about ten times lower than that obtained by using a polynomial function. This is an indication of the superiority of the neural network methodology with respect to the multivariate regression methodology, specially for an empirical function developed for a broad range of receptor-ligand complexes.
publishDate	2008
dc.date.issued.fl_str_mv	2008-08-25
dc.date.available.fl_str_mv	2009-06-18
dc.date.accessioned.fl_str_mv	2015-03-04T18:51:07Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	RÊGO, Thais Gaudencio do. Construction of Empirical Scoring Functions Using Artificial Neural Network for Determination of Affinites Constants Between Receptor-Ligand. 2008. 162 f. Dissertação (Mestrado em Modelagem computacional) - Laboratório Nacional de Computação Científica, Petrópolis, 2008.
dc.identifier.uri.fl_str_mv	https://tede.lncc.br/handle/tede/101
identifier_str_mv	RÊGO, Thais Gaudencio do. Construction of Empirical Scoring Functions Using Artificial Neural Network for Determination of Affinites Constants Between Receptor-Ligand. 2008. 162 f. Dissertação (Mestrado em Modelagem computacional) - Laboratório Nacional de Computação Científica, Petrópolis, 2008.
url	https://tede.lncc.br/handle/tede/101
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Laboratório Nacional de Computação Científica
dc.publisher.program.fl_str_mv	Programa de Pós-Graduação em Modelagem Computacional
dc.publisher.initials.fl_str_mv	LNCC
dc.publisher.country.fl_str_mv	BR
dc.publisher.department.fl_str_mv	Serviço de Análise e Apoio a Formação de Recursos Humanos
publisher.none.fl_str_mv	Laboratório Nacional de Computação Científica
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações do LNCC instname:Laboratório Nacional de Computação Científica (LNCC) instacron:LNCC
instname_str	Laboratório Nacional de Computação Científica (LNCC)
instacron_str	LNCC
institution	LNCC
reponame_str	Biblioteca Digital de Teses e Dissertações do LNCC
collection	Biblioteca Digital de Teses e Dissertações do LNCC
bitstream.url.fl_str_mv	http://tede-server.lncc.br:8080/tede/bitstream/tede/101/1/dissertacao+final.pdf http://tede-server.lncc.br:8080/tede/bitstream/tede/101/2/dissertacao+final.pdf.jpg
bitstream.checksum.fl_str_mv	07f1ebc0954cc7f649c905b7d60adcf7 5267d219141e4fdfcfca2da11f90ce40
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações do LNCC - Laboratório Nacional de Computação Científica (LNCC)
repository.mail.fl_str_mv	library@lncc.br\|\|library@lncc.br
_version_	1797689458913443840

Construção de Funções Empíricas Utilizando Rede Neural para Determinação de Constantes de Afinidade Receptor-Ligante

Registros relacionados