Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos

Detalhes bibliográficos
Ano de defesa: 2008
Autor(a) principal: Guilherme Henrique Trielli Ferreira
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Minas Gerais
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://hdl.handle.net/1843/RVMR-7PVQPU
Resumo: Extracting usable knowledge from large amounts of data has become one of the main challenges to a variety of fields, such as scientific, industrial or governmental areas. This task requires the data to be represented in a way that not only is the relational information captured, but that it also allows an effective and efficient mining of thesedata and the understanding of the resulting knowledge. In most of the cases, however, the data are modeled as graphs that arent able to represent multiple relations. This restriction may cause a flaw in the process of matching the real data with the model constructed, and, as a consequence, essential information of the real application is lost.Therefore, this work discuss data mining based on relations where, in contrast with traditional techniques, we propose an innovative heuristic of multigraph mining capable of dealing with multiple and indirect relations in the data, using these relations to identify correlated groups. We constructed a theoretical base on which many reallifeapplications can be modeled, so that they preserve important relations that exist in the data and are ignored by other models. We applied our new technique in a real scenario of co-authorship networks, in whichwe intend to group and classify scientific conferences based on authorship affinities. In order to do that, we modeled the data of these networks as multigraph sets, and then we use them to find groups of conferences that are correlated. If these groups can befound in, at least, a certain number of different parts of the multigraph, they will be considered as belonging to the same area. In spite of the fact that the problem we dealt with is NP-Complete and that there is a quite variety in the computational cost of the heuristic, experimental results show that our technique is effective in identifying different areas, even when the data is sparse.
id UFMG_c186761bf8a4de59ce7457a6ea3fd560
oai_identifier_str oai:repositorio.ufmg.br:1843/RVMR-7PVQPU
network_acronym_str UFMG
network_name_str Repositório Institucional da UFMG
repository_id_str
spelling Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretosComputaçãoRecuperação de informaçãomultigrafosredes de co-autoriamineração de dadosagrupamento de dadosExtracting usable knowledge from large amounts of data has become one of the main challenges to a variety of fields, such as scientific, industrial or governmental areas. This task requires the data to be represented in a way that not only is the relational information captured, but that it also allows an effective and efficient mining of thesedata and the understanding of the resulting knowledge. In most of the cases, however, the data are modeled as graphs that arent able to represent multiple relations. This restriction may cause a flaw in the process of matching the real data with the model constructed, and, as a consequence, essential information of the real application is lost.Therefore, this work discuss data mining based on relations where, in contrast with traditional techniques, we propose an innovative heuristic of multigraph mining capable of dealing with multiple and indirect relations in the data, using these relations to identify correlated groups. We constructed a theoretical base on which many reallifeapplications can be modeled, so that they preserve important relations that exist in the data and are ignored by other models. We applied our new technique in a real scenario of co-authorship networks, in whichwe intend to group and classify scientific conferences based on authorship affinities. In order to do that, we modeled the data of these networks as multigraph sets, and then we use them to find groups of conferences that are correlated. If these groups can befound in, at least, a certain number of different parts of the multigraph, they will be considered as belonging to the same area. In spite of the fact that the problem we dealt with is NP-Complete and that there is a quite variety in the computational cost of the heuristic, experimental results show that our technique is effective in identifying different areas, even when the data is sparse.Universidade Federal de Minas Gerais2019-08-13T17:17:09Z2025-09-09T01:07:56Z2019-08-13T17:17:09Z2008-12-12info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/1843/RVMR-7PVQPUGuilherme Henrique Trielli Ferreirainfo:eu-repo/semantics/openAccessporreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMG2025-09-09T01:07:56Zoai:repositorio.ufmg.br:1843/RVMR-7PVQPURepositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2025-09-09T01:07:56Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
title Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
spellingShingle Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
Guilherme Henrique Trielli Ferreira
Computação
Recuperação de informação
multigrafos
redes de co-autoria
mineração de dados
agrupamento de dados
title_short Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
title_full Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
title_fullStr Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
title_full_unstemmed Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
title_sort Uma heuristica para o problema de classificação de classificação de conferências explorando relacionamentos múltiplos e indiretos
author Guilherme Henrique Trielli Ferreira
author_facet Guilherme Henrique Trielli Ferreira
author_role author
dc.contributor.author.fl_str_mv Guilherme Henrique Trielli Ferreira
dc.subject.por.fl_str_mv Computação
Recuperação de informação
multigrafos
redes de co-autoria
mineração de dados
agrupamento de dados
topic Computação
Recuperação de informação
multigrafos
redes de co-autoria
mineração de dados
agrupamento de dados
description Extracting usable knowledge from large amounts of data has become one of the main challenges to a variety of fields, such as scientific, industrial or governmental areas. This task requires the data to be represented in a way that not only is the relational information captured, but that it also allows an effective and efficient mining of thesedata and the understanding of the resulting knowledge. In most of the cases, however, the data are modeled as graphs that arent able to represent multiple relations. This restriction may cause a flaw in the process of matching the real data with the model constructed, and, as a consequence, essential information of the real application is lost.Therefore, this work discuss data mining based on relations where, in contrast with traditional techniques, we propose an innovative heuristic of multigraph mining capable of dealing with multiple and indirect relations in the data, using these relations to identify correlated groups. We constructed a theoretical base on which many reallifeapplications can be modeled, so that they preserve important relations that exist in the data and are ignored by other models. We applied our new technique in a real scenario of co-authorship networks, in whichwe intend to group and classify scientific conferences based on authorship affinities. In order to do that, we modeled the data of these networks as multigraph sets, and then we use them to find groups of conferences that are correlated. If these groups can befound in, at least, a certain number of different parts of the multigraph, they will be considered as belonging to the same area. In spite of the fact that the problem we dealt with is NP-Complete and that there is a quite variety in the computational cost of the heuristic, experimental results show that our technique is effective in identifying different areas, even when the data is sparse.
publishDate 2008
dc.date.none.fl_str_mv 2008-12-12
2019-08-13T17:17:09Z
2019-08-13T17:17:09Z
2025-09-09T01:07:56Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/1843/RVMR-7PVQPU
url https://hdl.handle.net/1843/RVMR-7PVQPU
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFMG
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Repositório Institucional da UFMG
collection Repositório Institucional da UFMG
repository.name.fl_str_mv Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv repositorio@ufmg.br
_version_ 1856414064465412096