Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço

DANTAS, Diego de Oliveira

Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço

Detalhes bibliográficos
Ano de defesa:	2017
Autor(a) principal:	DANTAS, Diego de Oliveira
Orientador(a):	SANTOS, Sérgio Ronaldo Barros dos
Banca de defesa:	Não Informado pela instituição
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal do Maranhão
Programa de Pós-Graduação:	PROGRAMA DE PÓS-GRADUAÇÃO EM CIÊNCIA DA COMPUTAÇÃO/CCET
Departamento:	DEPARTAMENTO DE INFORMÁTICA/CCET
País:	Brasil
Palavras-chave em Português:	Robótica Aprendizado por reforço Learning Automata Construção autônoma Robotics Reinforcement learning Autonomous construction
Área do conhecimento CNPq:	Engenharia de Software
Link de acesso:	https://tedebc.ufma.br/jspui/handle/tede/2062
Resumo:	This work presents the development and implementation of an autonomous construction system in which uses a terrestrial mobile robot for constructing three-dimensional structures from blocks of different size. A high level planning is proposed to generate the construction plans of the structures. This algorithm is based on Reinforcement Learning methods called Finite Action-Set Learning Automata (FALA) and Parameterized Learnig Automata (PLA). From this planner, the used types of blocks for the construction and the final composition of the structure is defined by the user. The high level planner is used to solve the following problems: 1) Generate an optimal assembly diagram, which consists of a list of positions, orientations and kind of blocks final, taken into account the design of the structure defined by the user. The minimal number of blocks and also the restriction of assembly is considered during the generation of the diagram; 2) Generate an optimal execution plan that can be used by the robot to accomplish the task of assembly. This plan is composed by the sequence of procedures for manipulating and assembling blocks. The trajectories generated by the global planner based on A* algorithm is used to accomplish the execution plan. After completion the execution plan, the global planner sends a series of positions to a path tracking controller, called eband local planner. This tracking controller is used to control the robot during it navigation through simulated or actual environment. The mapping of the simulated and real environments and the location of the robot in the environment is performed using the algorithm called Real-Time Appearance-Based Mapping (RTAB-Map). The RTAB-Map uses image and odometry information to generate the environment also estimate the position of the robot in relation to the global coordinate system. The simulated and actual robots use the framework called Robot Operation System (ROS). The ROS allows the communication between different applications even if they are performed in different machines. To demonstrate the efficiency of the obtained solutions using the high level planner, simulated and experimental tests of the autonomous construction system are performed. During these tests, different types of structure (tower, containment wall, space station and pyramid) are assembled. The results show that the reinforcement learning method is able to feasible assembly diagrams and execution plans (sequence of procedures) can be used to perform the task in a short period of time.

Metadados do item

id	UFMA_524df2f23a2997e769524eeeb68f1154
oai_identifier_str	oai:tede2:tede/2062
network_acronym_str	UFMA
network_name_str	Biblioteca Digital de Teses e Dissertações da UFMA
repository_id_str
spelling	SANTOS, Sérgio Ronaldo Barros doshttp://lattes.cnpq.br/0608523738367987027.404.933-39http://lattes.cnpq.br/9622980317962859DANTAS, Diego de Oliveira2017-12-28T19:16:25Z2017-08-28DANTAS, Diego de Oliveira. Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço. 2017. 115 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal do Maranhão, São Luís, 2017.https://tedebc.ufma.br/jspui/handle/tede/2062This work presents the development and implementation of an autonomous construction system in which uses a terrestrial mobile robot for constructing three-dimensional structures from blocks of different size. A high level planning is proposed to generate the construction plans of the structures. This algorithm is based on Reinforcement Learning methods called Finite Action-Set Learning Automata (FALA) and Parameterized Learnig Automata (PLA). From this planner, the used types of blocks for the construction and the final composition of the structure is defined by the user. The high level planner is used to solve the following problems: 1) Generate an optimal assembly diagram, which consists of a list of positions, orientations and kind of blocks final, taken into account the design of the structure defined by the user. The minimal number of blocks and also the restriction of assembly is considered during the generation of the diagram; 2) Generate an optimal execution plan that can be used by the robot to accomplish the task of assembly. This plan is composed by the sequence of procedures for manipulating and assembling blocks. The trajectories generated by the global planner based on A* algorithm is used to accomplish the execution plan. After completion the execution plan, the global planner sends a series of positions to a path tracking controller, called eband local planner. This tracking controller is used to control the robot during it navigation through simulated or actual environment. The mapping of the simulated and real environments and the location of the robot in the environment is performed using the algorithm called Real-Time Appearance-Based Mapping (RTAB-Map). The RTAB-Map uses image and odometry information to generate the environment also estimate the position of the robot in relation to the global coordinate system. The simulated and actual robots use the framework called Robot Operation System (ROS). The ROS allows the communication between different applications even if they are performed in different machines. To demonstrate the efficiency of the obtained solutions using the high level planner, simulated and experimental tests of the autonomous construction system are performed. During these tests, different types of structure (tower, containment wall, space station and pyramid) are assembled. The results show that the reinforcement learning method is able to feasible assembly diagrams and execution plans (sequence of procedures) can be used to perform the task in a short period of time.Este trabalho apresenta o desenvolvimento e a implementação de um sistema de construção autônomo, no qual utiliza um robô móvel terrestre para construir estruturas tridimensionais a partir de blocos de diferentes tamanhos. Um planejamento de alto nível é proposto para gerar os planos de construção das estruturas. Esse algoritmo é baseado nos métodos de Aprendizado por Reforço, denominados de Finite Action-Set Learning Automata (FALA) e Parameterized Learnig Automata (PLA). A partir desse planejador, o usuário define os tipos de blocos empregados na construção e a forma final da estrutura. O planejador de alto nível é usado para resolver os seguintes problemas: 1) Gerar um diagrama ótimo de montagem, que consiste em uma lista de posições, orientações e tipos de blocos, respeitando a forma final especificada pelo usuário. Esse diagrama é gerado considerando a minimização da quantidade de blocos usados e obedecendo as restrições quanto ao posicionamento dos blocos; 2) Gerar um plano ótimo de execução que é usado pelo robô para realizar a tarefa de montagem da estrutura. Esse plano consiste em definir a sequência de procedimentos para a manipulação e para a montagem dos blocos. As trajetórias usadas para a realização do plano de execução são geradas por um planejador global composto pelo algoritmo A*. Ao finalizar o planejamento, o planejador global envia uma série de posições para um controlador de rastreamento de trajetória, chamado de eband local planner. Esse controlador de trajetória é usado para controlar a base móvel do robô durante sua navegação através do ambiente simulado ou real. O mapeamento do ambiente simulado e real e a localização do robô nesses ambientes é realizada através do algoritmo chamado de Real-Time Appearance-Based Mapping (RTAB-Map). O RTAB-Map usa informações de imagem e de odometria das rodas do robô para gerar o mapa e estimar a posição do robô em relação ao sistema de coordenadas global. Os robôs simulado e real utilizam os recursos do framework denominado de Robot Operation System (ROS). O ROS permite que diferentes aplicações comuniquem-se entre si, mesmo quando executadas em máquinas diferentes. Para demonstrar a eficiência das soluções obtidas pelo planejador de alto nível são realizados testes simulados e experimentais do sistema de construção autônomo. Durante esses testes são montadas diferentes tipos de estrutura (Torre, Barragem, Estação Espacial e Pirâmide). Os resultados mostram que o método de aprendizado por reforço é capaz de gerar diagramas de montagem e planos de execução (sequência de procedimentos) factíveis para a realização da tarefa em menor tempo possível.Submitted by Rosivalda Pereira (mrs.pereira@ufma.br) on 2017-12-28T19:16:25Z No. of bitstreams: 1 DiegoOliveiraDantas.pdf: 7136883 bytes, checksum: ea5ae117a8feb9cbf48a7497f2eca50b (MD5)Made available in DSpace on 2017-12-28T19:16:25Z (GMT). No. of bitstreams: 1 DiegoOliveiraDantas.pdf: 7136883 bytes, checksum: ea5ae117a8feb9cbf48a7497f2eca50b (MD5) Previous issue date: 2017-08-28CAPESapplication/pdfporUniversidade Federal do MaranhãoPROGRAMA DE PÓS-GRADUAÇÃO EM CIÊNCIA DA COMPUTAÇÃO/CCETUFMABrasilDEPARTAMENTO DE INFORMÁTICA/CCETRobóticaAprendizado por reforçoLearning AutomataConstrução autônomaRoboticsReinforcement learningAutonomous constructionEngenharia de SoftwareImplementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por ReforçoImplementation of an Autonomous System of Construção de Structure using Learned by Reforçoinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UFMAinstname:Universidade Federal do Maranhão (UFMA)instacron:UFMAORIGINALDiegoOliveiraDantas.pdfDiegoOliveiraDantas.pdfapplication/pdf7136883http://tedebc.ufma.br:8080/bitstream/tede/2062/2/DiegoOliveiraDantas.pdfea5ae117a8feb9cbf48a7497f2eca50bMD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82255http://tedebc.ufma.br:8080/bitstream/tede/2062/1/license.txt97eeade1fce43278e63fe063657f8083MD51tede/20622017-12-28 16:16:26.09oai:tede2:tede/2062IExJQ0VOw4dBIERFIERJU1RSSUJVScOHw4NPIE7Dg08tRVhDTFVTSVZBCgpDb20gYSBhcHJlc2VudGHDp8OjbyBkZXN0YSBsaWNlbsOnYSxvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvciBjb25jZWRlIMOgIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRvIE1hcmFuaMOjbyAoVUZNQSkgbyBkaXJlaXRvIG7Do28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsIHRyYWR1emlyIChjb25mb3JtZSBkZWZpbmlkbyBhYmFpeG8pLCBlL291IGRpc3RyaWJ1aXIgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyw7RuaWNvIGUgZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zIMOhdWRpbyBvdSB2w61kZW8uCgpWb2PDqiBjb25jb3JkYSBxdWUgYSBVRk1BIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBhIFVGTUEgcG9kZSBtYW50ZXIgbWFpcyBkZSB1bWEgY8OzcGlhIGRlIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gcGFyYSBmaW5zIGRlIHNlZ3VyYW7Dp2EsIGJhY2stdXAgZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogZGVjbGFyYSBxdWUgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIMOpIG9yaWdpbmFsIGUgcXVlIHZvY8OqIHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIG8gZGVww7NzaXRvIGRhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gbsOjbywgcXVlIHNlamEgZGUgc2V1IGNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiBkZWNsYXJhIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgw6AgVUZNQSBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBURVNFIE9VIERJU1NFUlRBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UgQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PIFFVRSBOw4NPIFNFSkEgQSBVRk1BLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyBUQU1Cw4lNIEFTIERFTUFJUyBPQlJJR0HDh8OVRVMgRVhJR0lEQVMgUE9SIENPTlRSQVRPIE9VIEFDT1JETy4KCkEgVUZNQSBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lIG91IG8ocykgbm9tZShzKSBkbyhzKSBkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbywgZSBuw6NvIGZhcsOhIHF1YWxxdWVyIGFsdGVyYcOnw6NvLCBhbMOpbSBkYXF1ZWxhcyBjb25jZWRpZGFzIHBvciBlc3RhIGxpY2Vuw6dhLgoKRGVjbGFyYSB0YW1iw6ltIHF1ZSB0b2RhcyBhcyBhZmlsaWHDp8O1ZXMgY29ycG9yYXRpdmFzIG91IGluc3RpdHVjaW9uYWlzIGUgdG9kYXMgYXMgZm9udGVzIGRlIGFwb2lvIGZpbmFuY2Vpcm8gYW8gdHJhYmFsaG8gZXN0w6NvIGRldmlkYW1lbnRlIGNpdGFkYXMgb3UgbWVuY2lvbmFkYXMgZSBjZXJ0aWZpY2EgcXVlIG7Do28gaMOhIG5lbmh1bSBpbnRlcmVzc2UgY29tZXJjaWFsIG91IGFzc29jaWF0aXZvIHF1ZSByZXByZXNlbnRlIGNvbmZsaXRvIGRlIGludGVyZXNzZSBlbSBjb25leMOjbyBjb20gbyB0cmFiYWxobyBzdWJtZXRpZG8uCgoKCgoKCgo=Biblioteca Digital de Teses e Dissertaçõeshttps://tedebc.ufma.br/jspui/PUBhttp://tedebc.ufma.br:8080/oai/requestrepositorio@ufma.br\|\|repositorio@ufma.bropendoar:21312017-12-28T19:16:26Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA)false
dc.title.por.fl_str_mv	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço
dc.title.alternative.eng.fl_str_mv	Implementation of an Autonomous System of Construção de Structure using Learned by Reforço
title	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço
spellingShingle	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço DANTAS, Diego de Oliveira Robótica Aprendizado por reforço Learning Automata Construção autônoma Robotics Reinforcement learning Autonomous construction Engenharia de Software
title_short	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço
title_full	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço
title_fullStr	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço
title_full_unstemmed	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço
title_sort	Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço
author	DANTAS, Diego de Oliveira
author_facet	DANTAS, Diego de Oliveira
author_role	author
dc.contributor.advisor1.fl_str_mv	SANTOS, Sérgio Ronaldo Barros dos
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/0608523738367987
dc.contributor.authorID.fl_str_mv	027.404.933-39
dc.contributor.authorLattes.fl_str_mv	http://lattes.cnpq.br/9622980317962859
dc.contributor.author.fl_str_mv	DANTAS, Diego de Oliveira
contributor_str_mv	SANTOS, Sérgio Ronaldo Barros dos
dc.subject.por.fl_str_mv	Robótica Aprendizado por reforço Learning Automata Construção autônoma Robotics Reinforcement learning Autonomous construction
topic	Robótica Aprendizado por reforço Learning Automata Construção autônoma Robotics Reinforcement learning Autonomous construction Engenharia de Software
dc.subject.cnpq.fl_str_mv	Engenharia de Software
description	This work presents the development and implementation of an autonomous construction system in which uses a terrestrial mobile robot for constructing three-dimensional structures from blocks of different size. A high level planning is proposed to generate the construction plans of the structures. This algorithm is based on Reinforcement Learning methods called Finite Action-Set Learning Automata (FALA) and Parameterized Learnig Automata (PLA). From this planner, the used types of blocks for the construction and the final composition of the structure is defined by the user. The high level planner is used to solve the following problems: 1) Generate an optimal assembly diagram, which consists of a list of positions, orientations and kind of blocks final, taken into account the design of the structure defined by the user. The minimal number of blocks and also the restriction of assembly is considered during the generation of the diagram; 2) Generate an optimal execution plan that can be used by the robot to accomplish the task of assembly. This plan is composed by the sequence of procedures for manipulating and assembling blocks. The trajectories generated by the global planner based on A* algorithm is used to accomplish the execution plan. After completion the execution plan, the global planner sends a series of positions to a path tracking controller, called eband local planner. This tracking controller is used to control the robot during it navigation through simulated or actual environment. The mapping of the simulated and real environments and the location of the robot in the environment is performed using the algorithm called Real-Time Appearance-Based Mapping (RTAB-Map). The RTAB-Map uses image and odometry information to generate the environment also estimate the position of the robot in relation to the global coordinate system. The simulated and actual robots use the framework called Robot Operation System (ROS). The ROS allows the communication between different applications even if they are performed in different machines. To demonstrate the efficiency of the obtained solutions using the high level planner, simulated and experimental tests of the autonomous construction system are performed. During these tests, different types of structure (tower, containment wall, space station and pyramid) are assembled. The results show that the reinforcement learning method is able to feasible assembly diagrams and execution plans (sequence of procedures) can be used to perform the task in a short period of time.
publishDate	2017
dc.date.accessioned.fl_str_mv	2017-12-28T19:16:25Z
dc.date.issued.fl_str_mv	2017-08-28
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	DANTAS, Diego de Oliveira. Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço. 2017. 115 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal do Maranhão, São Luís, 2017.
dc.identifier.uri.fl_str_mv	https://tedebc.ufma.br/jspui/handle/tede/2062
identifier_str_mv	DANTAS, Diego de Oliveira. Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço. 2017. 115 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal do Maranhão, São Luís, 2017.
url	https://tedebc.ufma.br/jspui/handle/tede/2062
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Federal do Maranhão
dc.publisher.program.fl_str_mv	PROGRAMA DE PÓS-GRADUAÇÃO EM CIÊNCIA DA COMPUTAÇÃO/CCET
dc.publisher.initials.fl_str_mv	UFMA
dc.publisher.country.fl_str_mv	Brasil
dc.publisher.department.fl_str_mv	DEPARTAMENTO DE INFORMÁTICA/CCET
publisher.none.fl_str_mv	Universidade Federal do Maranhão
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações da UFMA instname:Universidade Federal do Maranhão (UFMA) instacron:UFMA
instname_str	Universidade Federal do Maranhão (UFMA)
instacron_str	UFMA
institution	UFMA
reponame_str	Biblioteca Digital de Teses e Dissertações da UFMA
collection	Biblioteca Digital de Teses e Dissertações da UFMA
bitstream.url.fl_str_mv	http://tedebc.ufma.br:8080/bitstream/tede/2062/2/DiegoOliveiraDantas.pdf http://tedebc.ufma.br:8080/bitstream/tede/2062/1/license.txt
bitstream.checksum.fl_str_mv	ea5ae117a8feb9cbf48a7497f2eca50b 97eeade1fce43278e63fe063657f8083
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA)
repository.mail.fl_str_mv	repositorio@ufma.br\|\|repositorio@ufma.br
_version_	1853507995397259264

Implementação de um Sistema Autônomo de Construção de Estrutura usando Aprendizado por Reforço

Registros relacionados