CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Maciel, Allan James Ferreira

CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Detalhes bibliográficos
Ano de defesa:	2012
Autor(a) principal:	Maciel, Allan James Ferreira
Orientador(a):	FONSECA NETO, João Viana da
Banca de defesa:	Serra, Ginalber Luiz de Oliveira
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal do Maranhão
Programa de Pós-Graduação:	PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET
Departamento:	Engenharia
País:	BR
Palavras-chave em Português:	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital
Palavras-chave em Inglês:	Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control
Área do conhecimento CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Link de acesso:	http://tedebc.ufma.br:8080/jspui/handle/tede/494
Resumo:	The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system.

Metadados do item

id	UFMA_ff5f0924c162ca83136cdd7dbfba3001
oai_identifier_str	oai:tede2:tede/494
network_acronym_str	UFMA
network_name_str	Biblioteca Digital de Teses e Dissertações da UFMA
repository_id_str
spelling	FONSECA NETO, João Viana daCPF:2199749048http://lattes.cnpq.br/0029055473709795Serra, Ginalber Luiz de OliveiraCPF:79248934315http://lattes.cnpq.br/0831092299374520CPF:00304277380http://lattes.cnpq.br/9294927489743146Maciel, Allan James Ferreira2016-08-17T14:53:22Z2013-04-032012-09-28MACIEL, Allan James Ferreira. CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING. 2012. 121 f. Dissertação (Mestrado em Engenharia) - Universidade Federal do Maranhão, São Luís, 2012.http://tedebc.ufma.br:8080/jspui/handle/tede/494The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system.A união das metodologias de controle ótimo e de programação dinâmica tem impulsionado o desenvolvimento de algoritmos para realizações de sistemas de controle discreto do tipo regulador linear quadrático (DLQR). A metodologia utilizada neste trabalho é fundamentada sobre métodos de aprendizagem por reforço baseados em diferenças temporais e programação dinâmica aproximada. O método proposto combina a aproximação da função valor através do método RLS (mínimos quadrados recursivos) e iteração de política aproximada em esquemas de programação dinâmica heurística (HDP). A abordagem é orientada para a avaliação da convergência da solução DLQR e para a sintonia heurística das matrizes de ponderação 􀜳 e 􀜴da função de utilidade associada ao DLQR. É realizada a investigação das propriedades de convergência relacionadas à consistência, excitação persistente e polarização do estimador RLS. A metodologia contempla realizações de projetos de forma online de controladores DLQR e é avaliada em um sistema dinâmico multivariável de quarta ordem.Made available in DSpace on 2016-08-17T14:53:22Z (GMT). No. of bitstreams: 1 Dissertacao Allan James.pdf: 3170694 bytes, checksum: 054a9e74e81a7c2099800246d0b6c530 (MD5) Previous issue date: 2012-09-28Coordenação de Aperfeiçoamento de Pessoal de Nível Superiorapplication/pdfporUniversidade Federal do MaranhãoPROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCETUFMABREngenhariaProgramação Dinâmica HeurísticaControle MultivariávelControle ÓtimoRegulador Quadrático Linear DiscretoMínimos Quadrados RecursivosControle DigitalHeuristic Dynamic ProgrammingMultivariable ControlOptimal ControlDiscrete Linear Quadratic RegulatorRecursive Least SquaresDigital ControlCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOCONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICACONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMINGinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UFMAinstname:Universidade Federal do Maranhão (UFMA)instacron:UFMAORIGINALDissertacao Allan James.pdfapplication/pdf3170694http://tedebc.ufma.br:8080/bitstream/tede/494/1/Dissertacao+Allan+James.pdf054a9e74e81a7c2099800246d0b6c530MD51tede/4942018-01-26 18:07:07.541oai:tede2:tede/494Biblioteca Digital de Teses e Dissertaçõeshttps://tedebc.ufma.br/jspui/PUBhttp://tedebc.ufma.br:8080/oai/requestrepositorio@ufma.br\|\|repositorio@ufma.bropendoar:21312018-01-26T21:07:07Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA)false
dc.title.por.fl_str_mv	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
dc.title.alternative.eng.fl_str_mv	CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING
title	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
spellingShingle	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA Maciel, Allan James Ferreira Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_full	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_fullStr	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_full_unstemmed	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_sort	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
author	Maciel, Allan James Ferreira
author_facet	Maciel, Allan James Ferreira
author_role	author
dc.contributor.advisor1.fl_str_mv	FONSECA NETO, João Viana da
dc.contributor.advisor1ID.fl_str_mv	CPF:2199749048
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/0029055473709795
dc.contributor.referee1.fl_str_mv	Serra, Ginalber Luiz de Oliveira
dc.contributor.referee1ID.fl_str_mv	CPF:79248934315
dc.contributor.referee1Lattes.fl_str_mv	http://lattes.cnpq.br/0831092299374520
dc.contributor.authorID.fl_str_mv	CPF:00304277380
dc.contributor.authorLattes.fl_str_mv	http://lattes.cnpq.br/9294927489743146
dc.contributor.author.fl_str_mv	Maciel, Allan James Ferreira
contributor_str_mv	FONSECA NETO, João Viana da Serra, Ginalber Luiz de Oliveira
dc.subject.por.fl_str_mv	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital
topic	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.eng.fl_str_mv	Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control
dc.subject.cnpq.fl_str_mv	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system.
publishDate	2012
dc.date.issued.fl_str_mv	2012-09-28
dc.date.available.fl_str_mv	2013-04-03
dc.date.accessioned.fl_str_mv	2016-08-17T14:53:22Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	MACIEL, Allan James Ferreira. CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING. 2012. 121 f. Dissertação (Mestrado em Engenharia) - Universidade Federal do Maranhão, São Luís, 2012.
dc.identifier.uri.fl_str_mv	http://tedebc.ufma.br:8080/jspui/handle/tede/494
identifier_str_mv	MACIEL, Allan James Ferreira. CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING. 2012. 121 f. Dissertação (Mestrado em Engenharia) - Universidade Federal do Maranhão, São Luís, 2012.
url	http://tedebc.ufma.br:8080/jspui/handle/tede/494
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal do Maranhão
dc.publisher.program.fl_str_mv	PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET
dc.publisher.initials.fl_str_mv	UFMA
dc.publisher.country.fl_str_mv	BR
dc.publisher.department.fl_str_mv	Engenharia
publisher.none.fl_str_mv	Universidade Federal do Maranhão
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações da UFMA instname:Universidade Federal do Maranhão (UFMA) instacron:UFMA
instname_str	Universidade Federal do Maranhão (UFMA)
instacron_str	UFMA
institution	UFMA
reponame_str	Biblioteca Digital de Teses e Dissertações da UFMA
collection	Biblioteca Digital de Teses e Dissertações da UFMA
bitstream.url.fl_str_mv	http://tedebc.ufma.br:8080/bitstream/tede/494/1/Dissertacao+Allan+James.pdf
bitstream.checksum.fl_str_mv	054a9e74e81a7c2099800246d0b6c530
bitstream.checksumAlgorithm.fl_str_mv	MD5
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA)
repository.mail.fl_str_mv	repositorio@ufma.br\|\|repositorio@ufma.br
_version_	1853507978516234240

CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Registros relacionados