Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço

Santos, Alexandre Magno Monteiro

Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço

Detalhes bibliográficos
Ano de defesa:	2024
Autor(a) principal:	Santos, Alexandre Magno Monteiro
Orientador(a):	Cavalcante Neto, Joaquim Bento
Banca de defesa:	Não Informado pela instituição
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Não Informado pela instituição
Programa de Pós-Graduação:	Não Informado pela instituição
Departamento:	Não Informado pela instituição
País:	Não Informado pela instituição
Área do conhecimento CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Link de acesso:	http://repositorio.ufc.br/handle/riufc/81619
Resumo:	The advancement of technology and artificial intelligence has driven the use of reinforcement learning algorithms in areas such as education and video games. Despite the success of these applications, it is still unclear what factors agents learn that ensure their high performance. Understanding how these characteristics impact results is essential for increasing user confidence and identifying failures, especially in critical applications. This work investigates the influence of performance and robustness of machine learning models on the features that agents observe when solving problems. The study analyzes whether changes in explainability are noticeable among agents with different levels of robustness, using Explainable Artificial Intelligence methods and whether these changes can predict agents’ performance in environments outside the training set. Two experiments were carried out to conduct the analysis. In the first experiment, agents with different levels of robustness were analyzed using methods such as Grad-CAM, Integrated Gradients, SHAP, and LIME to generate explanations. The impact of robustness on the agents’ explainability was evaluated through qualitative and quantitative analyses. In the second experiment, variations in models and scenarios were compared to identify similarities and correlations with the rewards obtained by each agent, aiming to understand the relationship between the agent’s performance and the observed characteristics. The results show significant variations in explainability among agents with different levels of robustness, which can be used to predict model performance in unknown environments. It was also observed that agent performance is correlated with the similarity of explanations: agents whose explanations resemble those of successful models are more likely to achieve high performance. It is concluded, therefore, that the robustness and performance of an agent are linked to the learned features for solving a problem.

Metadados do item

id	UFC-7_86fed649fdd4cb691eb8c3b543d3ba4c
oai_identifier_str	oai:repositorio.ufc.br:riufc/81619
network_acronym_str	UFC-7
network_name_str	Repositório Institucional da Universidade Federal do Ceará (UFC)
repository_id_str
spelling	Santos, Alexandre Magno MonteiroNogueira, Yuri Lenon BarbosaCavalcante Neto, Joaquim Bento2025-07-17T16:30:40Z2025-07-17T16:30:40Z2024SANTOS, Alexandre Magno Monteiro. Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço. 2024. 73 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal do Ceará, Fortaleza, 2024.http://repositorio.ufc.br/handle/riufc/81619The advancement of technology and artificial intelligence has driven the use of reinforcement learning algorithms in areas such as education and video games. Despite the success of these applications, it is still unclear what factors agents learn that ensure their high performance. Understanding how these characteristics impact results is essential for increasing user confidence and identifying failures, especially in critical applications. This work investigates the influence of performance and robustness of machine learning models on the features that agents observe when solving problems. The study analyzes whether changes in explainability are noticeable among agents with different levels of robustness, using Explainable Artificial Intelligence methods and whether these changes can predict agents’ performance in environments outside the training set. Two experiments were carried out to conduct the analysis. In the first experiment, agents with different levels of robustness were analyzed using methods such as Grad-CAM, Integrated Gradients, SHAP, and LIME to generate explanations. The impact of robustness on the agents’ explainability was evaluated through qualitative and quantitative analyses. In the second experiment, variations in models and scenarios were compared to identify similarities and correlations with the rewards obtained by each agent, aiming to understand the relationship between the agent’s performance and the observed characteristics. The results show significant variations in explainability among agents with different levels of robustness, which can be used to predict model performance in unknown environments. It was also observed that agent performance is correlated with the similarity of explanations: agents whose explanations resemble those of successful models are more likely to achieve high performance. It is concluded, therefore, that the robustness and performance of an agent are linked to the learned features for solving a problem.O avanço da tecnologia e da Inteligência Artificial tem impulsionado o uso de algoritmos de aprendizado por reforço em áreas como educação e videogames. Apesar do sucesso dessas aplicações, ainda não está claro quais fatores os agentes aprendem que garantem seu desempenho elevado. Compreender como essas características impactam os resultados é essencial para aumentar a confiança dos usuários e identificar falhas, especialmente em aplicações críticas. Este trabalho investiga a influência do desempenho e da robustez de modelos de aprendizado de máquina nas características que os agentes observam ao resolver problemas. O estudo analisa se as mudanças na explicabilidade são perceptíveis entre agentes com diferentes níveis de robustez, utilizando métodos de Inteligência Artificial Explicável, e se essas alterações podem prever o desempenho dos agentes em ambientes fora do conjunto de treinamento. Para realizar a análise, foram conduzidos dois experimentos. No primeiro, agentes com diferentes níveis de robustez foram analisados usando os métodos de Grad-CAM, Gradientes Integrados, SHAP e LIME para gerar explicações. Avaliou-se o impacto da robustez na explicabilidade dos agentes por meio de análises qualitativas e quantitativas. No segundo experimento, foram comparadas variações de modelos e cenários para identificar semelhanças e correlações com as recompensas obtidas por cada agente, visando compreender a relação entre o desempenho do agente e as características observadas. Os resultados mostram variações significativas na explicabilidade entre agentes com diferentes níveis de robustez, que podem ser usadas para prever o desempenho do modelo em ambientes desconhecidos. Observou-se também que o desempenho do agente está correlacionado com a similaridade das explicações: agentes cujas explicações se assemelham às de modelos bem-sucedidos têm maior probabilidade de alcançar alto desempenho. Conclui-se, portanto, que a robustez e o desempenho de um agente estão ligados às características aprendidas para solucionar um problema.Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforçoUnveiling the relationships between explainability methods and reinforcement learning agent performanceinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisAlgoritmos de aprendizado por reforçoInteligência artificial explicávelDesempenho e robustez de agentesReinforcement learning algorithmsExplainable artificial intelligenceAgent performance and robustnessCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOinfo:eu-repo/semantics/openAccessporreponame:Repositório Institucional da Universidade Federal do Ceará (UFC)instname:Universidade Federal do Ceará (UFC)instacron:UFChttp://lattes.cnpq.br/0487992453736849http://lattes.cnpq.br/0866205347972203http://lattes.cnpq.br/99654586353977802025-07-17ORIGINAL2024_dis_ammsantos.pdf2024_dis_ammsantos.pdfapplication/pdf7876122http://repositorio.ufc.br/bitstream/riufc/81619/1/2024_dis_ammsantos.pdfb9f0de464cb53e3c2eb8777ce3787988MD51LICENSElicense.txtlicense.txttext/plain; charset=utf-81748http://repositorio.ufc.br/bitstream/riufc/81619/2/license.txt8a4605be74aa9ea9d79846c1fba20a33MD52riufc/816192025-07-17 13:30:43.015oai:repositorio.ufc.br:riufc/81619Tk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=Repositório InstitucionalPUBhttp://www.repositorio.ufc.br/ri-oai/requestbu@ufc.br \|\| repositorio@ufc.bropendoar:2025-07-17T16:30:43Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)false
dc.title.pt_BR.fl_str_mv	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço
dc.title.en.pt_BR.fl_str_mv	Unveiling the relationships between explainability methods and reinforcement learning agent performance
title	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço
spellingShingle	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço Santos, Alexandre Magno Monteiro CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO Algoritmos de aprendizado por reforço Inteligência artificial explicável Desempenho e robustez de agentes Reinforcement learning algorithms Explainable artificial intelligence Agent performance and robustness
title_short	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço
title_full	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço
title_fullStr	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço
title_full_unstemmed	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço
title_sort	Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço
author	Santos, Alexandre Magno Monteiro
author_facet	Santos, Alexandre Magno Monteiro
author_role	author
dc.contributor.co-advisor.none.fl_str_mv	Nogueira, Yuri Lenon Barbosa
dc.contributor.author.fl_str_mv	Santos, Alexandre Magno Monteiro
dc.contributor.advisor1.fl_str_mv	Cavalcante Neto, Joaquim Bento
contributor_str_mv	Cavalcante Neto, Joaquim Bento
dc.subject.cnpq.fl_str_mv	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
topic	CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO Algoritmos de aprendizado por reforço Inteligência artificial explicável Desempenho e robustez de agentes Reinforcement learning algorithms Explainable artificial intelligence Agent performance and robustness
dc.subject.ptbr.pt_BR.fl_str_mv	Algoritmos de aprendizado por reforço Inteligência artificial explicável Desempenho e robustez de agentes
dc.subject.en.pt_BR.fl_str_mv	Reinforcement learning algorithms Explainable artificial intelligence Agent performance and robustness
description	The advancement of technology and artificial intelligence has driven the use of reinforcement learning algorithms in areas such as education and video games. Despite the success of these applications, it is still unclear what factors agents learn that ensure their high performance. Understanding how these characteristics impact results is essential for increasing user confidence and identifying failures, especially in critical applications. This work investigates the influence of performance and robustness of machine learning models on the features that agents observe when solving problems. The study analyzes whether changes in explainability are noticeable among agents with different levels of robustness, using Explainable Artificial Intelligence methods and whether these changes can predict agents’ performance in environments outside the training set. Two experiments were carried out to conduct the analysis. In the first experiment, agents with different levels of robustness were analyzed using methods such as Grad-CAM, Integrated Gradients, SHAP, and LIME to generate explanations. The impact of robustness on the agents’ explainability was evaluated through qualitative and quantitative analyses. In the second experiment, variations in models and scenarios were compared to identify similarities and correlations with the rewards obtained by each agent, aiming to understand the relationship between the agent’s performance and the observed characteristics. The results show significant variations in explainability among agents with different levels of robustness, which can be used to predict model performance in unknown environments. It was also observed that agent performance is correlated with the similarity of explanations: agents whose explanations resemble those of successful models are more likely to achieve high performance. It is concluded, therefore, that the robustness and performance of an agent are linked to the learned features for solving a problem.
publishDate	2024
dc.date.issued.fl_str_mv	2024
dc.date.accessioned.fl_str_mv	2025-07-17T16:30:40Z
dc.date.available.fl_str_mv	2025-07-17T16:30:40Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	SANTOS, Alexandre Magno Monteiro. Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço. 2024. 73 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal do Ceará, Fortaleza, 2024.
dc.identifier.uri.fl_str_mv	http://repositorio.ufc.br/handle/riufc/81619
identifier_str_mv	SANTOS, Alexandre Magno Monteiro. Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço. 2024. 73 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal do Ceará, Fortaleza, 2024.
url	http://repositorio.ufc.br/handle/riufc/81619
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	reponame:Repositório Institucional da Universidade Federal do Ceará (UFC) instname:Universidade Federal do Ceará (UFC) instacron:UFC
instname_str	Universidade Federal do Ceará (UFC)
instacron_str	UFC
institution	UFC
reponame_str	Repositório Institucional da Universidade Federal do Ceará (UFC)
collection	Repositório Institucional da Universidade Federal do Ceará (UFC)
bitstream.url.fl_str_mv	http://repositorio.ufc.br/bitstream/riufc/81619/1/2024_dis_ammsantos.pdf http://repositorio.ufc.br/bitstream/riufc/81619/2/license.txt
bitstream.checksum.fl_str_mv	b9f0de464cb53e3c2eb8777ce3787988 8a4605be74aa9ea9d79846c1fba20a33
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)
repository.mail.fl_str_mv	bu@ufc.br \|\| repositorio@ufc.br
_version_	1847793224271265792

Revelando as relações entre métodos explicativos e desempenho de agentes de aprendizado por reforço

Registros relacionados