Detecção de posicionamento do cidadão em Projetos de Lei

Maia, Dyonnatan Ferreira

Detecção de posicionamento do cidadão em Projetos de Lei

Detalhes bibliográficos
Ano de defesa:	2024
Autor(a) principal:	Maia, Dyonnatan Ferreira
Orientador(a):	Silva, Nádia Félix Felipe da
Banca de defesa:	Silva, Nádia Félix Felipe da, Pereira, Fabíola Souza Fernande, Fernandes, Deborah Silva Alves
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Federal de Goiás
Programa de Pós-Graduação:	Programa de Pós-graduação em Ciência da Computação (INF)
Departamento:	Instituto de Informática - INF (RMG)
País:	Brasil
Palavras-chave em Português:	Detecção de posicionamento Aprendizado de máquina Processamento de linguagem natural Comentários políticos
Palavras-chave em Inglês:	Stance detection Machine learning Natural language processing Political comments
Área do conhecimento CNPq:	CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Link de acesso:	http://repositorio.bc.ufg.br/tede/handle/tede/13338
Resumo:	Background: Comments on political projects on the internet reflect the aspirations of a significant portion of the population. The automatic stance detection of these comments regarding specific topics can help better understand public opinion. This study aims to develop a computational model with supervised learning capable of estimating the stance of comments on legislative propositions, considering the challenge of diversity and the constant emergence of new bills. Method: For the domain studied, a specific corpus was constructed by collecting comments from surveys available on the Chamber of Deputies website. The experiments included the evaluation of classic machine learning models, such as Logistic Regression, Naive Bayes, Support Vector Machine, Random Forest, and Multilayer Perceptron, in addition to the fine-tuning of BERT language models. Automatic data annotation was also performed using the zero-shot approach based on prompts from the generative GPT-3.5 model, aiming to overcome the difficulties related to human annotation and the scarcity of annotated data, generating approximately three times the size of the manually annotated corpus. Results: The results indicate that the adjusted BERTimbau model surpassed the classic approaches, achieving an average F1- score of 70.4% on unseen topics. Moreover, the application of automatically annotated data in the initial stage of BERTimbau fine-tuning resulted in performance improvement, reaching an F1-score of 73.3%. The results present deep learning models as options with positive performance for the task under the conditions of this domain. Conclusion: It was observed that the ability to generate contextualized representations, along with the number of topics and comments trained, can directly interfere with performance. This makes automatic annotation and the exploration of topic diversity with Transformer architectures, promising approaches for the task

Metadados do item

id	UFG-2_eba30f419be665312059f9da7ec390e4
oai_identifier_str	oai:repositorio.bc.ufg.br:tede/13338
network_acronym_str	UFG-2
network_name_str	Repositório Institucional da UFG
repository_id_str
spelling	Silva, Nádia Félix Felipe dahttp://lattes.cnpq.br/7864834001694765Silva, Nádia Félix Felipe daPereira, Fabíola Souza FernandeFernandes, Deborah Silva Alveshttp://lattes.cnpq.br/0742505090314413Maia, Dyonnatan Ferreira2024-09-16T18:07:03Z2024-09-16T18:07:03Z2024-03-22MAIA, D. F. Detecção de posicionamento do cidadão em Projetos de Lei. 2024. 111 f. Dissertação (Mestrado em Ciência da Computação) - Instituto de Informática, Universidade Federal de Goiás, Goiânia, 2024.http://repositorio.bc.ufg.br/tede/handle/tede/13338Background: Comments on political projects on the internet reflect the aspirations of a significant portion of the population. The automatic stance detection of these comments regarding specific topics can help better understand public opinion. This study aims to develop a computational model with supervised learning capable of estimating the stance of comments on legislative propositions, considering the challenge of diversity and the constant emergence of new bills. Method: For the domain studied, a specific corpus was constructed by collecting comments from surveys available on the Chamber of Deputies website. The experiments included the evaluation of classic machine learning models, such as Logistic Regression, Naive Bayes, Support Vector Machine, Random Forest, and Multilayer Perceptron, in addition to the fine-tuning of BERT language models. Automatic data annotation was also performed using the zero-shot approach based on prompts from the generative GPT-3.5 model, aiming to overcome the difficulties related to human annotation and the scarcity of annotated data, generating approximately three times the size of the manually annotated corpus. Results: The results indicate that the adjusted BERTimbau model surpassed the classic approaches, achieving an average F1- score of 70.4% on unseen topics. Moreover, the application of automatically annotated data in the initial stage of BERTimbau fine-tuning resulted in performance improvement, reaching an F1-score of 73.3%. The results present deep learning models as options with positive performance for the task under the conditions of this domain. Conclusion: It was observed that the ability to generate contextualized representations, along with the number of topics and comments trained, can directly interfere with performance. This makes automatic annotation and the exploration of topic diversity with Transformer architectures, promising approaches for the taskContextualização: os comentários sobre projetos políticos na internet refletem os anseios de uma parte significativa da população. A detecção automática do posicionamento desses comentários em relação a tópicos específicos pode ajudar a compreender melhor a opinião pública. Este estudo visa desenvolver um modelo computacional com aprendizado supervisionado, capaz de estimar o posicionamento de comentários sobre Proposições Legislativas (PLs), considerando o desafio da diversidade e surgimento constante de novas PLs. Método: para o domínio estudado, construiu-se um corpus específico, coletando-se comentários de enquetes disponíveis no site da Câmara dos Deputados. Os experimentos incluíram a avaliação de modelos clássicos de aprendizado de máquina, como Regressão Logística, Naive Bayes, Máquina de Vetores de Suporte (SVM), Random Forest e Per-ceptron Multicamada, além do fine-tuning de modelos de linguagem BERT. Também se realizou a anotação automática de dados com abordagem zero-shot, baseada em prompt do modelo generativo GPT-3.5, visando superar as dificuldades relacionadas à anotação humana e à escassez de dados anotados, gerando três vezes o tamanho do corpus manualmente anotado. Resultados: os resultados indicam que o modelo BERTimbau com ajuste fino superou as abordagens clássicas, alcançando F1-score de 70,4% em tópicos não vistos. Além disso, a aplicação de dados anotados automaticamente na etapa inicial do fine-tuning do BERTimbau resultou em uma melhora do desempenho, atingindo um F1-score de 73,3%. Os resultados apresentam os modelos de aprendizado profundo como opções com desempenho positivo para a tarefa nas condições deste domínio. Conclusão: observou-se que a capacidade de gerar representações contextualizadas, juntamente à quantidade de tópicos e comentários treinados, pode interferir, diretamente, no desempenho dos modelos. Tornando-se a anotação automática e a exploração da diversidade de tópicos e da arquitetura Transformer, abordagens promissoras para a tarefaporUniversidade Federal de GoiásPrograma de Pós-graduação em Ciência da Computação (INF)UFGBrasilInstituto de Informática - INF (RMG)Attribution-NonCommercial-NoDerivatives 4.0 Internationalinfo:eu-repo/semantics/openAccessDetecção de posicionamentoDetecção de posicionamentoAprendizado de máquinaProcessamento de linguagem naturalComentários políticosStance detectionMachine learningNatural language processingPolitical commentsCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAODetecção de posicionamento do cidadão em Projetos de LeiStance detection of citizen opinions on billsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisreponame:Repositório Institucional da UFGinstname:Universidade Federal de Goiás (UFG)instacron:UFGLICENSElicense.txtlicense.txttext/plain; charset=utf-81748http://repositorio.bc.ufg.br/tede/bitstreams/92469425-f645-48df-975d-1a3d6574aaf2/download8a4605be74aa9ea9d79846c1fba20a33MD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8805http://repositorio.bc.ufg.br/tede/bitstreams/83338a10-b077-4ba4-a18c-aa97685330b6/download4460e5956bc1d1639be9ae6146a50347MD52ORIGINALDissertação - Dyonnatan Ferreira Maia - 2024.pdfDissertação - Dyonnatan Ferreira Maia - 2024.pdfapplication/pdf1978619http://repositorio.bc.ufg.br/tede/bitstreams/ea3da0e3-c34d-4984-aad8-31fbfa62385d/download126fd9491f6b1d8b63281da0713addd5MD53tede/133382024-09-16 15:07:03.176http://creativecommons.org/licenses/by-nc-nd/4.0/Attribution-NonCommercial-NoDerivatives 4.0 Internationalopen.accessoai:repositorio.bc.ufg.br:tede/13338http://repositorio.bc.ufg.br/tedeRepositório InstitucionalPUBhttps://repositorio.bc.ufg.br/tedeserver/oai/requestgrt.bc@ufg.bropendoar:oai:repositorio.bc.ufg.br:tede/12342024-09-16T18:07:03Repositório Institucional da UFG - Universidade Federal de Goiás (UFG)falseTk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=
dc.title.none.fl_str_mv	Detecção de posicionamento do cidadão em Projetos de Lei
dc.title.alternative.eng.fl_str_mv	Stance detection of citizen opinions on bills
title	Detecção de posicionamento do cidadão em Projetos de Lei
spellingShingle	Detecção de posicionamento do cidadão em Projetos de Lei Maia, Dyonnatan Ferreira Detecção de posicionamento Detecção de posicionamento Aprendizado de máquina Processamento de linguagem natural Comentários políticos Stance detection Machine learning Natural language processing Political comments CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	Detecção de posicionamento do cidadão em Projetos de Lei
title_full	Detecção de posicionamento do cidadão em Projetos de Lei
title_fullStr	Detecção de posicionamento do cidadão em Projetos de Lei
title_full_unstemmed	Detecção de posicionamento do cidadão em Projetos de Lei
title_sort	Detecção de posicionamento do cidadão em Projetos de Lei
author	Maia, Dyonnatan Ferreira
author_facet	Maia, Dyonnatan Ferreira
author_role	author
dc.contributor.advisor1.fl_str_mv	Silva, Nádia Félix Felipe da
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/7864834001694765
dc.contributor.referee1.fl_str_mv	Silva, Nádia Félix Felipe da
dc.contributor.referee2.fl_str_mv	Pereira, Fabíola Souza Fernande
dc.contributor.referee3.fl_str_mv	Fernandes, Deborah Silva Alves
dc.contributor.authorLattes.fl_str_mv	http://lattes.cnpq.br/0742505090314413
dc.contributor.author.fl_str_mv	Maia, Dyonnatan Ferreira
contributor_str_mv	Silva, Nádia Félix Felipe da Silva, Nádia Félix Felipe da Pereira, Fabíola Souza Fernande Fernandes, Deborah Silva Alves
dc.subject.por.fl_str_mv	Detecção de posicionamento Detecção de posicionamento Aprendizado de máquina Processamento de linguagem natural Comentários políticos
topic	Detecção de posicionamento Detecção de posicionamento Aprendizado de máquina Processamento de linguagem natural Comentários políticos Stance detection Machine learning Natural language processing Political comments CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.eng.fl_str_mv	Stance detection Machine learning Natural language processing Political comments
dc.subject.cnpq.fl_str_mv	CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	Background: Comments on political projects on the internet reflect the aspirations of a significant portion of the population. The automatic stance detection of these comments regarding specific topics can help better understand public opinion. This study aims to develop a computational model with supervised learning capable of estimating the stance of comments on legislative propositions, considering the challenge of diversity and the constant emergence of new bills. Method: For the domain studied, a specific corpus was constructed by collecting comments from surveys available on the Chamber of Deputies website. The experiments included the evaluation of classic machine learning models, such as Logistic Regression, Naive Bayes, Support Vector Machine, Random Forest, and Multilayer Perceptron, in addition to the fine-tuning of BERT language models. Automatic data annotation was also performed using the zero-shot approach based on prompts from the generative GPT-3.5 model, aiming to overcome the difficulties related to human annotation and the scarcity of annotated data, generating approximately three times the size of the manually annotated corpus. Results: The results indicate that the adjusted BERTimbau model surpassed the classic approaches, achieving an average F1- score of 70.4% on unseen topics. Moreover, the application of automatically annotated data in the initial stage of BERTimbau fine-tuning resulted in performance improvement, reaching an F1-score of 73.3%. The results present deep learning models as options with positive performance for the task under the conditions of this domain. Conclusion: It was observed that the ability to generate contextualized representations, along with the number of topics and comments trained, can directly interfere with performance. This makes automatic annotation and the exploration of topic diversity with Transformer architectures, promising approaches for the task
publishDate	2024
dc.date.accessioned.fl_str_mv	2024-09-16T18:07:03Z
dc.date.available.fl_str_mv	2024-09-16T18:07:03Z
dc.date.issued.fl_str_mv	2024-03-22
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	MAIA, D. F. Detecção de posicionamento do cidadão em Projetos de Lei. 2024. 111 f. Dissertação (Mestrado em Ciência da Computação) - Instituto de Informática, Universidade Federal de Goiás, Goiânia, 2024.
dc.identifier.uri.fl_str_mv	http://repositorio.bc.ufg.br/tede/handle/tede/13338
identifier_str_mv	MAIA, D. F. Detecção de posicionamento do cidadão em Projetos de Lei. 2024. 111 f. Dissertação (Mestrado em Ciência da Computação) - Instituto de Informática, Universidade Federal de Goiás, Goiânia, 2024.
url	http://repositorio.bc.ufg.br/tede/handle/tede/13338
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	Attribution-NonCommercial-NoDerivatives 4.0 International info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Attribution-NonCommercial-NoDerivatives 4.0 International
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Universidade Federal de Goiás
dc.publisher.program.fl_str_mv	Programa de Pós-graduação em Ciência da Computação (INF)
dc.publisher.initials.fl_str_mv	UFG
dc.publisher.country.fl_str_mv	Brasil
dc.publisher.department.fl_str_mv	Instituto de Informática - INF (RMG)
publisher.none.fl_str_mv	Universidade Federal de Goiás
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UFG instname:Universidade Federal de Goiás (UFG) instacron:UFG
instname_str	Universidade Federal de Goiás (UFG)
instacron_str	UFG
institution	UFG
reponame_str	Repositório Institucional da UFG
collection	Repositório Institucional da UFG
bitstream.url.fl_str_mv	http://repositorio.bc.ufg.br/tede/bitstreams/92469425-f645-48df-975d-1a3d6574aaf2/download http://repositorio.bc.ufg.br/tede/bitstreams/83338a10-b077-4ba4-a18c-aa97685330b6/download http://repositorio.bc.ufg.br/tede/bitstreams/ea3da0e3-c34d-4984-aad8-31fbfa62385d/download
bitstream.checksum.fl_str_mv	8a4605be74aa9ea9d79846c1fba20a33 4460e5956bc1d1639be9ae6146a50347 126fd9491f6b1d8b63281da0713addd5
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositório Institucional da UFG - Universidade Federal de Goiás (UFG)
repository.mail.fl_str_mv	grt.bc@ufg.br
_version_	1861293819210760192

Detecção de posicionamento do cidadão em Projetos de Lei

Registros relacionados