A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour

Detalhes bibliográficos
Ano de defesa: 2021
Autor(a) principal: MATTOS, Juliana Barcellos
Orientador(a): VIMIEIRO, Renato
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: eng
Instituição de defesa: Universidade Federal de Pernambuco
Programa de Pós-Graduação: Programa de Pos Graduacao em Ciencia da Computacao
Departamento: Não Informado pela instituição
País: Brasil
Palavras-chave em Português:
Link de acesso: https://repositorio.ufpe.br/handle/123456789/44957
Resumo: A variety of works in the literature strive to uncover the factors associated with survival behaviour. However, the computational tools to provide such information are global models designed to predict if or when a (survival) event will occur. When addressing the problem of explaining differences in survival behaviour, those approaches rely on (assumptions of) predictive features followed by risk stratification. In other words, they lack the ability to discover local exceptionalities in the data and provide new information on factors related to survival. In this work, we aim at providing a computational tool to identify the different (unusual) survival responses that may occur in a population of individuals and provide straightforward information about the circumstances related to such responses. We approach such a problem from the perspective of supervised descriptive pattern mining to discover local patterns associated with different survival behaviours. Hence, we introduce an Exceptional Model Mining (EMM) framework to provide straightforward characterisations of subgroups presenting unusual survival models, given by the Kaplan-Meier estimates. In contrast to the greedy search heuristics prevalent among EMM approaches, we employ stochastic optimisation and introduce the first approach in the literature to explore the Ant-Colony Optimisation (ACO) meta-heuristics for the subgroup search. Thus, we tackle the problem of subgroup redundancy to provide a set of exceptional subgroups that are diverse in their descriptions, coverages and survival models. We conducted experiments on fourteen real-world data sets to assess the performance of our approach. In the results, we show that the framework presented is capable of discovering representative patterns with accurate unusual models and straightforward representations. Moreover, the discovered subgroups potentially capture survival behaviours existent in the data. The approach successfully tackles the problem of subgroup redundancy, providing a set of diverse (unique) exceptional (survival) subgroups. Our framework outperforms the other existent approaches to provide characterisations over unusual survival behaviours regarding the descriptive aspect of its results and diversity of its findings.
id UFPE_df133dbcaefcf16b80008b72e7981b44
oai_identifier_str oai:repositorio.ufpe.br:123456789/44957
network_acronym_str UFPE
network_name_str Repositório Institucional da UFPE
repository_id_str
spelling MATTOS, Juliana Barcelloshttp://lattes.cnpq.br/7907615802587388http://lattes.cnpq.br/5736183954752317http://lattes.cnpq.br/4610098557429398VIMIEIRO, RenatoMATTOS NETO, Paulo Salgado Gomes de2022-07-04T16:04:48Z2022-07-04T16:04:48Z2021-12-10MATTOS, Juliana Barcellos. A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour. 2021. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Pernambuco, Recife, 2021.https://repositorio.ufpe.br/handle/123456789/44957A variety of works in the literature strive to uncover the factors associated with survival behaviour. However, the computational tools to provide such information are global models designed to predict if or when a (survival) event will occur. When addressing the problem of explaining differences in survival behaviour, those approaches rely on (assumptions of) predictive features followed by risk stratification. In other words, they lack the ability to discover local exceptionalities in the data and provide new information on factors related to survival. In this work, we aim at providing a computational tool to identify the different (unusual) survival responses that may occur in a population of individuals and provide straightforward information about the circumstances related to such responses. We approach such a problem from the perspective of supervised descriptive pattern mining to discover local patterns associated with different survival behaviours. Hence, we introduce an Exceptional Model Mining (EMM) framework to provide straightforward characterisations of subgroups presenting unusual survival models, given by the Kaplan-Meier estimates. In contrast to the greedy search heuristics prevalent among EMM approaches, we employ stochastic optimisation and introduce the first approach in the literature to explore the Ant-Colony Optimisation (ACO) meta-heuristics for the subgroup search. Thus, we tackle the problem of subgroup redundancy to provide a set of exceptional subgroups that are diverse in their descriptions, coverages and survival models. We conducted experiments on fourteen real-world data sets to assess the performance of our approach. In the results, we show that the framework presented is capable of discovering representative patterns with accurate unusual models and straightforward representations. Moreover, the discovered subgroups potentially capture survival behaviours existent in the data. The approach successfully tackles the problem of subgroup redundancy, providing a set of diverse (unique) exceptional (survival) subgroups. Our framework outperforms the other existent approaches to provide characterisations over unusual survival behaviours regarding the descriptive aspect of its results and diversity of its findings.CAPESDiversos trabalhos na literatura dedicam-se a descobrir fatores associados a comportamentos de sobrevivência. As ferramentas computacionais utilizadas para tal são modelos globais projetados para estimar se e quando um dado evento de sobrevivência ocorrerá. Em se tratando do problema de explicar diferentes respostas de sobrevivência, as abordagens existentes não são capazes de descobrir excepcionalidades locais nos dados nem prover novos conhecimentos a respeito de fatores associados à sobrevivência, respaldando-se em suposições e a análises estratificadas. Este trabalho tem por objetivo apresentar uma nova ferramenta computacional para identificação e caracterização de diferentes respostas de sobrevivência existentes em uma população de indivíduos. Neste trabalho, o problema enunciado é abordado através da perspectiva da mineração supervisionada de padrões descritivos (em inglês, supervised descriptive pattern mining) com o intuito de descobrir padrões locais associados a diferentes comportamentos de sobrevivência. Para tal, é empregada a técnica de mineração de modelos excepcionais (do inglês, Exceptional Model Mining) com o objetivo de descrever – de forma simples e concisa – subgrupos que apresentem modelos de sobrevivência (Kaplan-Meier) não usuais. Em contraste às heurísticas ‘gulosas’ prevalentes na literatura de mineração de modelos excepcionais, a abordagem introduzida neste trabalho explora o uso da meta-heurística de otimização Ant-Colony Optimisation na busca por subgrupos. O problema de redundância de padrões também é considerado, objetivando a descoberta de um conjunto de subgrupos que sejam diversos com relação às suas descrições, coberturas e modelos. O desempenho da abordagem apresentada é avaliada em quatorze conjuntos de dados reais. Os resultados mostram que o algoritmo proposto é capaz de descobrir padrões representativos que apresentam modelos precisos e caracterizações de simples compreensão. Adicionalmente, os subgrupos descobertos potencialmente capturam comportamentos de sobrevivência existentes nos dados. A redundância de padrões é abordada de forma bem-sucedida, tal que os resultados retornados apresentam conjuntos de subgrupos que são diversos (únicos) e excepcionais. Quando comparado a outras abordagens existentes na literatura que fornecem caracterizações de comportamentos incomuns de sobrevivência, o algoritmo apresentado se sobressai aos demais tanto em relação ao aspecto descritivo de seus resultados quanto à diversidade de suas descobertas.engUniversidade Federal de PernambucoPrograma de Pos Graduacao em Ciencia da ComputacaoUFPEBrasilAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessInteligência computacionalMineração de modelosA supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviourinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesismestradoreponame:Repositório Institucional da UFPEinstname:Universidade Federal de Pernambuco (UFPE)instacron:UFPEORIGINALDISSERTAÇÃO Juliana Barcellos Mattos.pdfDISSERTAÇÃO Juliana Barcellos Mattos.pdfapplication/pdf1371675https://repositorio.ufpe.br/bitstream/123456789/44957/1/DISSERTA%c3%87%c3%83O%20Juliana%20Barcellos%20Mattos.pdfefc2197847f84611bb93b92f8c7acfabMD51LICENSElicense.txtlicense.txttext/plain; charset=utf-82142https://repositorio.ufpe.br/bitstream/123456789/44957/3/license.txt6928b9260b07fb2755249a5ca9903395MD53TEXTDISSERTAÇÃO Juliana Barcellos Mattos.pdf.txtDISSERTAÇÃO Juliana Barcellos Mattos.pdf.txtExtracted texttext/plain283507https://repositorio.ufpe.br/bitstream/123456789/44957/4/DISSERTA%c3%87%c3%83O%20Juliana%20Barcellos%20Mattos.pdf.txtdea8366722177619ebb9817d7b95e83cMD54THUMBNAILDISSERTAÇÃO Juliana Barcellos Mattos.pdf.jpgDISSERTAÇÃO Juliana Barcellos Mattos.pdf.jpgGenerated Thumbnailimage/jpeg1257https://repositorio.ufpe.br/bitstream/123456789/44957/5/DISSERTA%c3%87%c3%83O%20Juliana%20Barcellos%20Mattos.pdf.jpgddd907414aafe749b081c991efb96144MD55CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufpe.br/bitstream/123456789/44957/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52123456789/449572022-07-05 02:18:47.086oai:repositorio.ufpe.br:123456789/44957VGVybW8gZGUgRGVww7NzaXRvIExlZ2FsIGUgQXV0b3JpemHDp8OjbyBwYXJhIFB1YmxpY2HDp8OjbyBkZSBEb2N1bWVudG9zIG5vIFJlcG9zaXTDs3JpbyBEaWdpdGFsIGRhIFVGUEUKIAoKRGVjbGFybyBlc3RhciBjaWVudGUgZGUgcXVlIGVzdGUgVGVybW8gZGUgRGVww7NzaXRvIExlZ2FsIGUgQXV0b3JpemHDp8OjbyB0ZW0gbyBvYmpldGl2byBkZSBkaXZ1bGdhw6fDo28gZG9zIGRvY3VtZW50b3MgZGVwb3NpdGFkb3Mgbm8gUmVwb3NpdMOzcmlvIERpZ2l0YWwgZGEgVUZQRSBlIGRlY2xhcm8gcXVlOgoKSSAtICBvIGNvbnRlw7pkbyBkaXNwb25pYmlsaXphZG8gw6kgZGUgcmVzcG9uc2FiaWxpZGFkZSBkZSBzdWEgYXV0b3JpYTsKCklJIC0gbyBjb250ZcO6ZG8gw6kgb3JpZ2luYWwsIGUgc2UgbyB0cmFiYWxobyBlL291IHBhbGF2cmFzIGRlIG91dHJhcyBwZXNzb2FzIGZvcmFtIHV0aWxpemFkb3MsIGVzdGFzIGZvcmFtIGRldmlkYW1lbnRlIHJlY29uaGVjaWRhczsKCklJSSAtIHF1YW5kbyB0cmF0YXItc2UgZGUgVHJhYmFsaG8gZGUgQ29uY2x1c8OjbyBkZSBDdXJzbywgRGlzc2VydGHDp8OjbyBvdSBUZXNlOiBvIGFycXVpdm8gZGVwb3NpdGFkbyBjb3JyZXNwb25kZSDDoCB2ZXJzw6NvIGZpbmFsIGRvIHRyYWJhbGhvOwoKSVYgLSBxdWFuZG8gdHJhdGFyLXNlIGRlIFRyYWJhbGhvIGRlIENvbmNsdXPDo28gZGUgQ3Vyc28sIERpc3NlcnRhw6fDo28gb3UgVGVzZTogZXN0b3UgY2llbnRlIGRlIHF1ZSBhIGFsdGVyYcOnw6NvIGRhIG1vZGFsaWRhZGUgZGUgYWNlc3NvIGFvIGRvY3VtZW50byBhcMOzcyBvIGRlcMOzc2l0byBlIGFudGVzIGRlIGZpbmRhciBvIHBlcsOtb2RvIGRlIGVtYmFyZ28sIHF1YW5kbyBmb3IgZXNjb2xoaWRvIGFjZXNzbyByZXN0cml0bywgc2Vyw6EgcGVybWl0aWRhIG1lZGlhbnRlIHNvbGljaXRhw6fDo28gZG8gKGEpIGF1dG9yIChhKSBhbyBTaXN0ZW1hIEludGVncmFkbyBkZSBCaWJsaW90ZWNhcyBkYSBVRlBFIChTSUIvVUZQRSkuCgogClBhcmEgdHJhYmFsaG9zIGVtIEFjZXNzbyBBYmVydG86CgpOYSBxdWFsaWRhZGUgZGUgdGl0dWxhciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGUgYXV0b3IgcXVlIHJlY2FlbSBzb2JyZSBlc3RlIGRvY3VtZW50bywgZnVuZGFtZW50YWRvIG5hIExlaSBkZSBEaXJlaXRvIEF1dG9yYWwgbm8gOS42MTAsIGRlIDE5IGRlIGZldmVyZWlybyBkZSAxOTk4LCBhcnQuIDI5LCBpbmNpc28gSUlJLCBhdXRvcml6byBhIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRlIFBlcm5hbWJ1Y28gYSBkaXNwb25pYmlsaXphciBncmF0dWl0YW1lbnRlLCBzZW0gcmVzc2FyY2ltZW50byBkb3MgZGlyZWl0b3MgYXV0b3JhaXMsIHBhcmEgZmlucyBkZSBsZWl0dXJhLCBpbXByZXNzw6NvIGUvb3UgZG93bmxvYWQgKGFxdWlzacOnw6NvKSBhdHJhdsOpcyBkbyBzaXRlIGRvIFJlcG9zaXTDs3JpbyBEaWdpdGFsIGRhIFVGUEUgbm8gZW5kZXJlw6dvIGh0dHA6Ly93d3cucmVwb3NpdG9yaW8udWZwZS5iciwgYSBwYXJ0aXIgZGEgZGF0YSBkZSBkZXDDs3NpdG8uCgogClBhcmEgdHJhYmFsaG9zIGVtIEFjZXNzbyBSZXN0cml0bzoKCk5hIHF1YWxpZGFkZSBkZSB0aXR1bGFyIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkZSBhdXRvciBxdWUgcmVjYWVtIHNvYnJlIGVzdGUgZG9jdW1lbnRvLCBmdW5kYW1lbnRhZG8gbmEgTGVpIGRlIERpcmVpdG8gQXV0b3JhbCBubyA5LjYxMCBkZSAxOSBkZSBmZXZlcmVpcm8gZGUgMTk5OCwgYXJ0LiAyOSwgaW5jaXNvIElJSSwgYXV0b3Jpem8gYSBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBQZXJuYW1idWNvIGEgZGlzcG9uaWJpbGl6YXIgZ3JhdHVpdGFtZW50ZSwgc2VtIHJlc3NhcmNpbWVudG8gZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCBwYXJhIGZpbnMgZGUgbGVpdHVyYSwgaW1wcmVzc8OjbyBlL291IGRvd25sb2FkIChhcXVpc2nDp8OjbykgYXRyYXbDqXMgZG8gc2l0ZSBkbyBSZXBvc2l0w7NyaW8gRGlnaXRhbCBkYSBVRlBFIG5vIGVuZGVyZcOnbyBodHRwOi8vd3d3LnJlcG9zaXRvcmlvLnVmcGUuYnIsIHF1YW5kbyBmaW5kYXIgbyBwZXLDrW9kbyBkZSBlbWJhcmdvIGNvbmRpemVudGUgYW8gdGlwbyBkZSBkb2N1bWVudG8sIGNvbmZvcm1lIGluZGljYWRvIG5vIGNhbXBvIERhdGEgZGUgRW1iYXJnby4KRepositório InstitucionalPUBhttps://repositorio.ufpe.br/oai/requestattena@ufpe.bropendoar:22212022-07-05T05:18:47Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)false
dc.title.pt_BR.fl_str_mv A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
title A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
spellingShingle A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
MATTOS, Juliana Barcellos
Inteligência computacional
Mineração de modelos
title_short A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
title_full A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
title_fullStr A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
title_full_unstemmed A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
title_sort A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour
author MATTOS, Juliana Barcellos
author_facet MATTOS, Juliana Barcellos
author_role author
dc.contributor.authorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/7907615802587388
dc.contributor.advisorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/5736183954752317
dc.contributor.advisor-coLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/4610098557429398
dc.contributor.author.fl_str_mv MATTOS, Juliana Barcellos
dc.contributor.advisor1.fl_str_mv VIMIEIRO, Renato
dc.contributor.advisor-co1.fl_str_mv MATTOS NETO, Paulo Salgado Gomes de
contributor_str_mv VIMIEIRO, Renato
MATTOS NETO, Paulo Salgado Gomes de
dc.subject.por.fl_str_mv Inteligência computacional
Mineração de modelos
topic Inteligência computacional
Mineração de modelos
description A variety of works in the literature strive to uncover the factors associated with survival behaviour. However, the computational tools to provide such information are global models designed to predict if or when a (survival) event will occur. When addressing the problem of explaining differences in survival behaviour, those approaches rely on (assumptions of) predictive features followed by risk stratification. In other words, they lack the ability to discover local exceptionalities in the data and provide new information on factors related to survival. In this work, we aim at providing a computational tool to identify the different (unusual) survival responses that may occur in a population of individuals and provide straightforward information about the circumstances related to such responses. We approach such a problem from the perspective of supervised descriptive pattern mining to discover local patterns associated with different survival behaviours. Hence, we introduce an Exceptional Model Mining (EMM) framework to provide straightforward characterisations of subgroups presenting unusual survival models, given by the Kaplan-Meier estimates. In contrast to the greedy search heuristics prevalent among EMM approaches, we employ stochastic optimisation and introduce the first approach in the literature to explore the Ant-Colony Optimisation (ACO) meta-heuristics for the subgroup search. Thus, we tackle the problem of subgroup redundancy to provide a set of exceptional subgroups that are diverse in their descriptions, coverages and survival models. We conducted experiments on fourteen real-world data sets to assess the performance of our approach. In the results, we show that the framework presented is capable of discovering representative patterns with accurate unusual models and straightforward representations. Moreover, the discovered subgroups potentially capture survival behaviours existent in the data. The approach successfully tackles the problem of subgroup redundancy, providing a set of diverse (unique) exceptional (survival) subgroups. Our framework outperforms the other existent approaches to provide characterisations over unusual survival behaviours regarding the descriptive aspect of its results and diversity of its findings.
publishDate 2021
dc.date.issued.fl_str_mv 2021-12-10
dc.date.accessioned.fl_str_mv 2022-07-04T16:04:48Z
dc.date.available.fl_str_mv 2022-07-04T16:04:48Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv MATTOS, Juliana Barcellos. A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour. 2021. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Pernambuco, Recife, 2021.
dc.identifier.uri.fl_str_mv https://repositorio.ufpe.br/handle/123456789/44957
identifier_str_mv MATTOS, Juliana Barcellos. A supervised descriptive local pattern mining approach to the discovery of subgroups with exceptional survival behaviour. 2021. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Pernambuco, Recife, 2021.
url https://repositorio.ufpe.br/handle/123456789/44957
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de Pernambuco
dc.publisher.program.fl_str_mv Programa de Pos Graduacao em Ciencia da Computacao
dc.publisher.initials.fl_str_mv UFPE
dc.publisher.country.fl_str_mv Brasil
publisher.none.fl_str_mv Universidade Federal de Pernambuco
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFPE
instname:Universidade Federal de Pernambuco (UFPE)
instacron:UFPE
instname_str Universidade Federal de Pernambuco (UFPE)
instacron_str UFPE
institution UFPE
reponame_str Repositório Institucional da UFPE
collection Repositório Institucional da UFPE
bitstream.url.fl_str_mv https://repositorio.ufpe.br/bitstream/123456789/44957/1/DISSERTA%c3%87%c3%83O%20Juliana%20Barcellos%20Mattos.pdf
https://repositorio.ufpe.br/bitstream/123456789/44957/3/license.txt
https://repositorio.ufpe.br/bitstream/123456789/44957/4/DISSERTA%c3%87%c3%83O%20Juliana%20Barcellos%20Mattos.pdf.txt
https://repositorio.ufpe.br/bitstream/123456789/44957/5/DISSERTA%c3%87%c3%83O%20Juliana%20Barcellos%20Mattos.pdf.jpg
https://repositorio.ufpe.br/bitstream/123456789/44957/2/license_rdf
bitstream.checksum.fl_str_mv efc2197847f84611bb93b92f8c7acfab
6928b9260b07fb2755249a5ca9903395
dea8366722177619ebb9817d7b95e83c
ddd907414aafe749b081c991efb96144
e39d27027a6cc9cb039ad269a5db8e34
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)
repository.mail.fl_str_mv attena@ufpe.br
_version_ 1862741825602715648