Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados

Sargiani, Vagner

Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados

Detalhes bibliográficos
Ano de defesa:	2018
Autor(a) principal:	Sargiani, Vagner
Orientador(a):	Silva, Leandro Augusto da
Banca de defesa:	Notargiacomo, Pollyana Coelho da Silva , Barcelos, Thiago Schumacher
Tipo de documento:	Dissertação
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Universidade Presbiteriana Mackenzie
Programa de Pós-Graduação:	Engenharia Elétrica
Departamento:	Faculdade de Computação e Informática (FCI)
País:	Brasil
Palavras-chave em Português:	mineração de texto mapas auto organizáveis visualização semântica
Área do conhecimento CNPq:	CNPQ::CIENCIAS EXATAS E DA TERRA CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Link de acesso:	http://dspace.mackenzie.br/handle/10899/24472
Resumo:	At present there is the generation of a large volume of textual data, and part of this volume is generated by so-called social media, where people connect, exchange information and experiences.These data contains valuable implicit knowledge, which can be extracted and analyzed according to the media selected and the type of knowledge wanted. The objective of this work is to demonstrate how to use data mining resources, analytical tools and neural networks of the type Self Organizing Maps (SOM) to perform analysis on textual data and knowledge generation. There will be two approaches: knowledge for the educational area (with data from Question and Answer sites, or simply (Q&A))) and trend identi_cation (with posts in microblog Twitter). Both sources are similar in that they have an unstructured text format. Based on an array of terms generated through Text Mining techniques, originated in a base composed by unstructured text, the posts were the basis for training a SOM network, and with this trained network it was possible to generate visualizations that allow to perform semantic analysis of the terms and questions grouped together and use them to identify the desired knowledge. The results obtained were: to demonstrate that questions about similar subjects can be grouped by their similarity of terms, and to visualize these groupings in the form of word clouds, allowing the semantic analysis on the grouped questions.

Metadados do item

id	UPM_7f14a1c1dbb1a03580d234a0da5455e0
oai_identifier_str	oai:dspace.mackenzie.br:10899/24472
network_acronym_str	UPM
network_name_str	Biblioteca Digital de Teses e Dissertações do Mackenzie
repository_id_str
spelling	2018-04-28T17:31:19Z2020-05-28T18:08:53Z2020-05-28T18:08:53Z2018-02-05SARGIANI, Vagner. Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados. 2018. 64 f. Dissertação( Engenharia Elétrica) - Universidade Presbiteriana Mackenzie, São Paulo.http://dspace.mackenzie.br/handle/10899/24472At present there is the generation of a large volume of textual data, and part of this volume is generated by so-called social media, where people connect, exchange information and experiences.These data contains valuable implicit knowledge, which can be extracted and analyzed according to the media selected and the type of knowledge wanted. The objective of this work is to demonstrate how to use data mining resources, analytical tools and neural networks of the type Self Organizing Maps (SOM) to perform analysis on textual data and knowledge generation. There will be two approaches: knowledge for the educational area (with data from Question and Answer sites, or simply (Q&A))) and trend identi_cation (with posts in microblog Twitter). Both sources are similar in that they have an unstructured text format. Based on an array of terms generated through Text Mining techniques, originated in a base composed by unstructured text, the posts were the basis for training a SOM network, and with this trained network it was possible to generate visualizations that allow to perform semantic analysis of the terms and questions grouped together and use them to identify the desired knowledge. The results obtained were: to demonstrate that questions about similar subjects can be grouped by their similarity of terms, and to visualize these groupings in the form of word clouds, allowing the semantic analysis on the grouped questions.Na atualidade existe a geração de um grande volume de dados textuais, sendo que parte deste volume é gerado pelas chamadas mídias sociais, no qual pessoas se conectam, trocam informações e experiências. Estes dados contém conhecimento implícito valioso, que pode ser extraído e analisado de acordo com a mídia selecionada e o tipo de conhecimento procurado. O objetivo deste trabalho é demonstrar como utilizar recursos de mineração de dados, ferramentas analíticas e redes neurais do tipo Self Organized Maps (SOM) para efetuar análise sobre dados textuais e geração de conhecimento. Serão duas as abordagens: conhecimentos voltados para a área educacional (com dados de sites de Perguntas e Respostas (Question and Answers, ou simplesmente Q&A)) e identificação de tendências (com postagens no microblog Twitter). Ambas as fontes são similares em possuirem um formato de texto não estruturado. Com base em uma matriz de termos gerada através de técnicas de Mineração de Textos, originada em uma base composta por texto não estruturado, as postagens foram a base para treinamento de uma rede SOM, e com esta rede treinada foi possível gerar visualizações que permitem efetuar análises semânticas dos termos e questões agrupados e utilizá-las para identificação do conhecimento desejado. Os resultados obtidos foram: demonstrar que questões sobre assuntos similares podem ser agrupadas pela sua similaridade de termos, e visualizar estes agrupamentos em forma de nuvens de palavras, permitindo a análise semântica sobre as questões agrupadas.Coordenação de Aperfeiçoamento de Pessoal de Nível SuperiorFundo Mackenzie de Pesquisaapplication/pdfporUniversidade Presbiteriana MackenzieEngenharia ElétricaUPMBrasilFaculdade de Computação e Informática (FCI)http://creativecommons.org/licenses/by-nc-nd/4.0/info:eu-repo/semantics/openAccessmineração de textomapas auto organizáveisvisualizaçãosemânticaCNPQ::CIENCIAS EXATAS E DA TERRACNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOIdentificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dadosinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisSilva, Leandro Augusto dahttp://lattes.cnpq.br/1396385111251741Notargiacomo, Pollyana Coelho da Silvahttp://lattes.cnpq.br/5131975026612008Barcelos, Thiago Schumacherhttp://lattes.cnpq.br/0179728954543082http://lattes.cnpq.br/9363303337287168Sargiani, Vagnerhttp://tede.mackenzie.br/jspui/retrieve/16422/VAGNER%20SARGIANI.pdf.jpghttp://tede.mackenzie.br/jspui/bitstream/tede/3565/5/VAGNER%20SARGIANI.pdftext miningself organizing mapsvisualizationsemanticreponame:Biblioteca Digital de Teses e Dissertações do Mackenzieinstname:Universidade Presbiteriana Mackenzie (MACKENZIE)instacron:MACKENZIE10899/244722020-05-28 15:08:53.627Biblioteca Digital de Teses e Dissertaçõeshttp://tede.mackenzie.br/jspui/PRI
dc.title.por.fl_str_mv	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados
title	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados
spellingShingle	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados Sargiani, Vagner mineração de texto mapas auto organizáveis visualização semântica CNPQ::CIENCIAS EXATAS E DA TERRA CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados
title_full	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados
title_fullStr	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados
title_full_unstemmed	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados
title_sort	Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados
author	Sargiani, Vagner
author_facet	Sargiani, Vagner
author_role	author
dc.contributor.advisor1.fl_str_mv	Silva, Leandro Augusto da
dc.contributor.advisor1Lattes.fl_str_mv	http://lattes.cnpq.br/1396385111251741
dc.contributor.referee1.fl_str_mv	Notargiacomo, Pollyana Coelho da Silva
dc.contributor.referee1Lattes.fl_str_mv	http://lattes.cnpq.br/5131975026612008
dc.contributor.referee2.fl_str_mv	Barcelos, Thiago Schumacher
dc.contributor.referee2Lattes.fl_str_mv	http://lattes.cnpq.br/0179728954543082
dc.contributor.authorLattes.fl_str_mv	http://lattes.cnpq.br/9363303337287168
dc.contributor.author.fl_str_mv	Sargiani, Vagner
contributor_str_mv	Silva, Leandro Augusto da Notargiacomo, Pollyana Coelho da Silva Barcelos, Thiago Schumacher
dc.subject.por.fl_str_mv	mineração de texto mapas auto organizáveis visualização semântica
topic	mineração de texto mapas auto organizáveis visualização semântica CNPQ::CIENCIAS EXATAS E DA TERRA CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
dc.subject.cnpq.fl_str_mv	CNPQ::CIENCIAS EXATAS E DA TERRA CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	At present there is the generation of a large volume of textual data, and part of this volume is generated by so-called social media, where people connect, exchange information and experiences.These data contains valuable implicit knowledge, which can be extracted and analyzed according to the media selected and the type of knowledge wanted. The objective of this work is to demonstrate how to use data mining resources, analytical tools and neural networks of the type Self Organizing Maps (SOM) to perform analysis on textual data and knowledge generation. There will be two approaches: knowledge for the educational area (with data from Question and Answer sites, or simply (Q&A))) and trend identi_cation (with posts in microblog Twitter). Both sources are similar in that they have an unstructured text format. Based on an array of terms generated through Text Mining techniques, originated in a base composed by unstructured text, the posts were the basis for training a SOM network, and with this trained network it was possible to generate visualizations that allow to perform semantic analysis of the terms and questions grouped together and use them to identify the desired knowledge. The results obtained were: to demonstrate that questions about similar subjects can be grouped by their similarity of terms, and to visualize these groupings in the form of word clouds, allowing the semantic analysis on the grouped questions.
publishDate	2018
dc.date.accessioned.fl_str_mv	2018-04-28T17:31:19Z 2020-05-28T18:08:53Z
dc.date.issued.fl_str_mv	2018-02-05
dc.date.available.fl_str_mv	2020-05-28T18:08:53Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.citation.fl_str_mv	SARGIANI, Vagner. Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados. 2018. 64 f. Dissertação( Engenharia Elétrica) - Universidade Presbiteriana Mackenzie, São Paulo.
dc.identifier.uri.fl_str_mv	http://dspace.mackenzie.br/handle/10899/24472
identifier_str_mv	SARGIANI, Vagner. Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados. 2018. 64 f. Dissertação( Engenharia Elétrica) - Universidade Presbiteriana Mackenzie, São Paulo.
url	http://dspace.mackenzie.br/handle/10899/24472
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	http://creativecommons.org/licenses/by-nc-nd/4.0/ info:eu-repo/semantics/openAccess
rights_invalid_str_mv	http://creativecommons.org/licenses/by-nc-nd/4.0/
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Presbiteriana Mackenzie
dc.publisher.program.fl_str_mv	Engenharia Elétrica
dc.publisher.initials.fl_str_mv	UPM
dc.publisher.country.fl_str_mv	Brasil
dc.publisher.department.fl_str_mv	Faculdade de Computação e Informática (FCI)
publisher.none.fl_str_mv	Universidade Presbiteriana Mackenzie
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações do Mackenzie instname:Universidade Presbiteriana Mackenzie (MACKENZIE) instacron:MACKENZIE
instname_str	Universidade Presbiteriana Mackenzie (MACKENZIE)
instacron_str	MACKENZIE
institution	MACKENZIE
reponame_str	Biblioteca Digital de Teses e Dissertações do Mackenzie
collection	Biblioteca Digital de Teses e Dissertações do Mackenzie
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_	1757174472674115584

Identificação de padrões em textos de mídias sociais utilizando redes neurais e visualização de dados

Registros relacionados