Modelo de análise de sentimentos com base na estrutura linguística da sentença

Detalhes bibliográficos
Ano de defesa: 2021
Autor(a) principal: Silva, Arnoldo Nunes da
Orientador(a): Souza, José Neuman
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Não Informado pela instituição
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: http://www.repositorio.ufc.br/handle/riufc/58345
Resumo: Sentiment Analysis (SA) is an area of Natural Language Processing that leads to detect the presence of positive, negative or neutral polarity in a text. SA became an element of greater interest due to the increase of data generation from the internet, and the efforts required to process such volume of data. Due to this demand, there is an endeavour to achieve precise results that will allow the sentiment inserted in the text to be automatically interpreted . Several methods can be applied in this task, however, in this thesis the aspects and challenges in exploring the linguistic structure of the sentence stand out, for which the solutions are directed to rules with the properties of the grammar that describe the natural language. It is worth noting that studies of solutions based on rules are also motivated by excluding costs that involve data training. A review of the state of the art, shows that there are advances in studies of formal linguistics that contribute to computational linguistics, as rules of a syntactic description of Brazilian Portuguese already available in literature, and have not been explored for sentiment analysis. In particular, works with restricted models of rules and the use of parsers were detected to define the parts of speech, sentence structure, and dependency relations. However, no solutions were found involving a grammar constructed to describe a natural language incorporated into a parser that analyzes the sentence structure specifically characterized by sentiment. The main result obtained from this thesis was a new model of sentiment analysis based on an expandable regular grammar defined by rules of semantic composition. Thus, a parser was developed that identifies sentence structures characterized or not with positive or negative polarity. To meet this solution, a set of relations of sentiment between grammatical categories were studied and developed based on the formal description of the sentence structure. A prototype was implemented to test the application of the model in corpora sentences and later for a comparative evaluation of the results, which presented rates at the same levels obtained by other methods.
id UFC-7_f526301c408391f2274382d08cf7c2ed
oai_identifier_str oai:repositorio.ufc.br:riufc/58345
network_acronym_str UFC-7
network_name_str Repositório Institucional da Universidade Federal do Ceará (UFC)
repository_id_str
spelling Silva, Arnoldo Nunes daSouza, Osvaldo deSouza, José Neuman2021-05-13T10:49:37Z2021-05-13T10:49:37Z2021SILVA, Arnoldo Nunes da. Modelo de análise de sentimentos com base na estrutura linguística da sentença. 2021. 97 f. Tese (Doutorado em Ciência da Computação) - Universidade Federal do Ceará, Fortaleza, 2021.http://www.repositorio.ufc.br/handle/riufc/58345Sentiment Analysis (SA) is an area of Natural Language Processing that leads to detect the presence of positive, negative or neutral polarity in a text. SA became an element of greater interest due to the increase of data generation from the internet, and the efforts required to process such volume of data. Due to this demand, there is an endeavour to achieve precise results that will allow the sentiment inserted in the text to be automatically interpreted . Several methods can be applied in this task, however, in this thesis the aspects and challenges in exploring the linguistic structure of the sentence stand out, for which the solutions are directed to rules with the properties of the grammar that describe the natural language. It is worth noting that studies of solutions based on rules are also motivated by excluding costs that involve data training. A review of the state of the art, shows that there are advances in studies of formal linguistics that contribute to computational linguistics, as rules of a syntactic description of Brazilian Portuguese already available in literature, and have not been explored for sentiment analysis. In particular, works with restricted models of rules and the use of parsers were detected to define the parts of speech, sentence structure, and dependency relations. However, no solutions were found involving a grammar constructed to describe a natural language incorporated into a parser that analyzes the sentence structure specifically characterized by sentiment. The main result obtained from this thesis was a new model of sentiment analysis based on an expandable regular grammar defined by rules of semantic composition. Thus, a parser was developed that identifies sentence structures characterized or not with positive or negative polarity. To meet this solution, a set of relations of sentiment between grammatical categories were studied and developed based on the formal description of the sentence structure. A prototype was implemented to test the application of the model in corpora sentences and later for a comparative evaluation of the results, which presented rates at the same levels obtained by other methods.A Análise de Sentimentos (SA) é uma área do Processamento de Linguagem Natural que permite detectar a presença de polaridade positiva, negativa ou neutra em um texto. O SA tornou-se um elemento de maior interesse devido ao aumento da geração de dados a partir da internet e aos esforços necessários para processar esse volume de dados. Diante dessa demanda, busca-se obter resultados precisos que permitam a interpretação automática do sentimento inserido no texto. Vários métodos podem ser aplicados nesta tarefa, porém, nesta tese se destacam os aspectos e desafios em explorar a estrutura linguística da frase, para os quais as soluções são direcionadas a regras com as propriedades da gramática que descrevem a linguagem natural. Vale ressaltar que estudos de soluções baseadas em regras também são motivados pela exclusão de custos que envolvem treinamento de dados. Uma revisão do estado da arte, mostra que existem avanços nos estudos da linguística formal que contribuem para a linguística computacional, como regras de uma descrição sintática do português brasileiro já disponível na literatura, e não foram explorados para análise de sentimento. Em particular, trabalhos com modelos restritos de regras e o uso de parsers foram detectados para definir as classes gramaticais, a estrutura da sentença e as relações de dependência. No entanto, nenhuma solução foi encontrada envolvendo uma gramática construída para descrever uma linguagem natural incorporada a um parser que analisa a estrutura da frase caracterizada especificamente pelo sentimento. O principal resultado obtido com esta tese foi um novo modelo de análise de sentimento baseado em uma gramática regular expansível definida por regras de composição semântica. Assim, foi desenvolvido um parser que identifica estruturas de sentenças caracterizadas ou não com polaridade positiva ou negativa. Para atender a essa solução, um conjunto de relações de sentimento entre categorias gramaticais foi estudado e desenvolvido a partir da descrição formal da estrutura da frase. Foi implementado um protótipo para testar a aplicação do modelo em corpora de sentenças e posteriormente para uma avaliação comparativa dos resultados, que apresentaram índices nos mesmos níveis obtidos por outros métodos.Análise de SentimentosProcessamento de Linguagem NaturalLinguística ComputacionalModelo de análise de sentimentos com base na estrutura linguística da sentençaModel of sentiment analysis based on the linguistic structure of the sentenceinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisporreponame:Repositório Institucional da Universidade Federal do Ceará (UFC)instname:Universidade Federal do Ceará (UFC)instacron:UFCinfo:eu-repo/semantics/openAccessLICENSElicense.txtlicense.txttext/plain; charset=utf-82125http://repositorio.ufc.br/bitstream/riufc/58345/4/license.txtce2f77d9db6511060b9277b356f86c2dMD54ORIGINAL2021_tese_ansilva.pdf2021_tese_ansilva.pdfapplication/pdf3987695http://repositorio.ufc.br/bitstream/riufc/58345/3/2021_tese_ansilva.pdf815fc86bf047112530169e8bcad618c4MD53riufc/583452021-05-13 07:49:37.623oai:repositorio.ufc.br:riufc/58345TElDRU7Dh0EgREUgQVJNQVpFTkFNRU5UTyBFIERJU1RSSUJVScOHw4NPIE7Dg08tRVhDTFVTSVZBIA0KDQpBbyBjb25jb3JkYXIgY29tIGVzdGEgbGljZW7Dp2EsIHZvY8OqKHMpIGF1dG9yKGVzKSBvdSB0aXR1bGFyKGVzKSBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGEgb2JyYSBhcXVpIGRlc2NyaXRhIGNvbmNlZGUobSkgw6AgVW5pdmVyc2lkYWRlIEZlZGVyYWwgZG8gQ2VhcsOhLCBnZXN0b3JhIGRvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGQyAtIFJJL1VGQywgbyBkaXJlaXRvIG7Do28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsIGNvbnZlcnRlciAoY29tbyBkZWZpbmlkbyBhYmFpeG8pIGUvb3UgZGlzdHJpYnVpciBvIGRvY3VtZW50byBkZXBvc2l0YWRvIGVtIGZvcm1hdG8gaW1wcmVzc28sIGVsZXRyw7RuaWNvIG91IGVtIHF1YWxxdWVyIG91dHJvIG1laW8uIFZvY8OqIGNvbmNvcmRhKG0pIHF1ZSBhIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRvIENlYXLDoSwgZ2VzdG9yYSBkbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRkMgLSBSSS9VRkMsIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCBjb252ZXJ0ZXIgbyBhcnF1aXZvIGRlcG9zaXRhZG8gYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gY29tIGZpbnMgZGUgcHJlc2VydmHDp8Ojby4gVm9jw6oocykgdGFtYsOpbSBjb25jb3JkYShtKSBxdWUgYSBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkbyBDZWFyw6EsIGdlc3RvcmEgZG8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgZGEgVUZDIC0gUkkvVUZDLCBwb2RlIG1hbnRlciBtYWlzIGRlIHVtYSBjw7NwaWEgZGVzdGUgZGVww7NzaXRvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUvb3UgcHJlc2VydmHDp8Ojby4gVm9jw6ogZGVjbGFyYSBxdWUgYSBhcHJlc2VudGHDp8OjbyBkbyBzZXUgdHJhYmFsaG8gw6kgb3JpZ2luYWwgZSBxdWUgdm9jw6oocykgcG9kZShtKSBjb25jZWRlciBvcyBkaXJlaXRvcyBjb250aWRvcyBuZXN0YSBsaWNlbsOnYS4gVm9jw6ogdGFtYsOpbSBkZWNsYXJhKG0pIHF1ZSBvIGVudmlvIMOpIGRlIHNldSBjb25oZWNpbWVudG8gZSBuw6NvIGluZnJpbmdlIG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRlIG91dHJhIHBlc3NvYSBvdSBpbnN0aXR1acOnw6NvLiBDYXNvIG8gZG9jdW1lbnRvIGEgc2VyIGRlcG9zaXRhZG8gY29udGVuaGEgbWF0ZXJpYWwgcGFyYSBvIHF1YWwgdm9jw6oocykgbsOjbyBkZXTDqW0gYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGRlIGF1dG9yYWlzLCB2b2PDqihzKSBkZWNsYXJhKG0pIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gdGl0dWxhciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGUgY29uY2VkZXIgw6AgVW5pdmVyc2lkYWRlIEZlZGVyYWwgZG8gQ2VhcsOhLCBnZXN0b3JhIGRvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGQyAtIFJJL1VGQywgb3MgZGlyZWl0b3MgcmVxdWVyaWRvcyBwb3IgZXN0YSBsaWNlbsOnYSBlIHF1ZSBvcyBtYXRlcmlhaXMgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zLCBlc3TDo28gZGV2aWRhbWVudGUgaWRlbnRpZmljYWRvcyBlIHJlY29uaGVjaWRvcyBubyB0ZXh0byBvdSBjb250ZcO6ZG8gZGEgYXByZXNlbnRhw6fDo28uDQoNCkNBU08gTyBUUkFCQUxITyBERVBPU0lUQURPIFRFTkhBIFNJRE8gRklOQU5DSUFETyBPVSBBUE9JQURPIFBPUiBVTSDDk1JHw4NPLCBRVUUgTsODTyBBIElOU1RJVFVJw4fDg08gREVTVEUgUkVQT1NJVMOTUklPOiBWT0PDiiBERUNMQVJBIFRFUiBDVU1QUklETyBUT0RPUyBPUyBESVJFSVRPUyBERSBSRVZJU8ODTyBFIFFVQUlTUVVFUiBPVVRSQVMgT0JSSUdBw4fDlUVTIFJFUVVFUklEQVMgUEVMTyBDT05UUkFUTyBPVSBBQ09SRE8uIA0KDQpPIHJlcG9zaXTDs3JpbyBpZGVudGlmaWNhcsOhIGNsYXJhbWVudGUgbyBzZXUocykgbm9tZShzKSBjb21vIGF1dG9yKGVzKSBvdSB0aXR1bGFyKGVzKSBkbyBkaXJlaXRvIGRlIGF1dG9yKGVzKSBkbyBkb2N1bWVudG8gc3VibWV0aWRvIGUgZGVjbGFyYSBxdWUgbsOjbyBmYXLDoSBxdWFscXVlciBhbHRlcmHDp8OjbyBhbMOpbSBkYXMgcGVybWl0aWRhcyBwb3IgZXN0YSBsaWNlbsOnYS4NCg==Repositório InstitucionalPUBhttp://www.repositorio.ufc.br/ri-oai/requestbu@ufc.br || repositorio@ufc.bropendoar:2021-05-13T10:49:37Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)false
dc.title.pt_BR.fl_str_mv Modelo de análise de sentimentos com base na estrutura linguística da sentença
dc.title.en.pt_BR.fl_str_mv Model of sentiment analysis based on the linguistic structure of the sentence
title Modelo de análise de sentimentos com base na estrutura linguística da sentença
spellingShingle Modelo de análise de sentimentos com base na estrutura linguística da sentença
Silva, Arnoldo Nunes da
Análise de Sentimentos
Processamento de Linguagem Natural
Linguística Computacional
title_short Modelo de análise de sentimentos com base na estrutura linguística da sentença
title_full Modelo de análise de sentimentos com base na estrutura linguística da sentença
title_fullStr Modelo de análise de sentimentos com base na estrutura linguística da sentença
title_full_unstemmed Modelo de análise de sentimentos com base na estrutura linguística da sentença
title_sort Modelo de análise de sentimentos com base na estrutura linguística da sentença
author Silva, Arnoldo Nunes da
author_facet Silva, Arnoldo Nunes da
author_role author
dc.contributor.co-advisor.none.fl_str_mv Souza, Osvaldo de
dc.contributor.author.fl_str_mv Silva, Arnoldo Nunes da
dc.contributor.advisor1.fl_str_mv Souza, José Neuman
contributor_str_mv Souza, José Neuman
dc.subject.por.fl_str_mv Análise de Sentimentos
Processamento de Linguagem Natural
Linguística Computacional
topic Análise de Sentimentos
Processamento de Linguagem Natural
Linguística Computacional
description Sentiment Analysis (SA) is an area of Natural Language Processing that leads to detect the presence of positive, negative or neutral polarity in a text. SA became an element of greater interest due to the increase of data generation from the internet, and the efforts required to process such volume of data. Due to this demand, there is an endeavour to achieve precise results that will allow the sentiment inserted in the text to be automatically interpreted . Several methods can be applied in this task, however, in this thesis the aspects and challenges in exploring the linguistic structure of the sentence stand out, for which the solutions are directed to rules with the properties of the grammar that describe the natural language. It is worth noting that studies of solutions based on rules are also motivated by excluding costs that involve data training. A review of the state of the art, shows that there are advances in studies of formal linguistics that contribute to computational linguistics, as rules of a syntactic description of Brazilian Portuguese already available in literature, and have not been explored for sentiment analysis. In particular, works with restricted models of rules and the use of parsers were detected to define the parts of speech, sentence structure, and dependency relations. However, no solutions were found involving a grammar constructed to describe a natural language incorporated into a parser that analyzes the sentence structure specifically characterized by sentiment. The main result obtained from this thesis was a new model of sentiment analysis based on an expandable regular grammar defined by rules of semantic composition. Thus, a parser was developed that identifies sentence structures characterized or not with positive or negative polarity. To meet this solution, a set of relations of sentiment between grammatical categories were studied and developed based on the formal description of the sentence structure. A prototype was implemented to test the application of the model in corpora sentences and later for a comparative evaluation of the results, which presented rates at the same levels obtained by other methods.
publishDate 2021
dc.date.accessioned.fl_str_mv 2021-05-13T10:49:37Z
dc.date.available.fl_str_mv 2021-05-13T10:49:37Z
dc.date.issued.fl_str_mv 2021
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv SILVA, Arnoldo Nunes da. Modelo de análise de sentimentos com base na estrutura linguística da sentença. 2021. 97 f. Tese (Doutorado em Ciência da Computação) - Universidade Federal do Ceará, Fortaleza, 2021.
dc.identifier.uri.fl_str_mv http://www.repositorio.ufc.br/handle/riufc/58345
identifier_str_mv SILVA, Arnoldo Nunes da. Modelo de análise de sentimentos com base na estrutura linguística da sentença. 2021. 97 f. Tese (Doutorado em Ciência da Computação) - Universidade Federal do Ceará, Fortaleza, 2021.
url http://www.repositorio.ufc.br/handle/riufc/58345
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da Universidade Federal do Ceará (UFC)
instname:Universidade Federal do Ceará (UFC)
instacron:UFC
instname_str Universidade Federal do Ceará (UFC)
instacron_str UFC
institution UFC
reponame_str Repositório Institucional da Universidade Federal do Ceará (UFC)
collection Repositório Institucional da Universidade Federal do Ceará (UFC)
bitstream.url.fl_str_mv http://repositorio.ufc.br/bitstream/riufc/58345/4/license.txt
http://repositorio.ufc.br/bitstream/riufc/58345/3/2021_tese_ansilva.pdf
bitstream.checksum.fl_str_mv ce2f77d9db6511060b9277b356f86c2d
815fc86bf047112530169e8bcad618c4
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)
repository.mail.fl_str_mv bu@ufc.br || repositorio@ufc.br
_version_ 1847793375625871360