Network-based high level classification: novel models and applications
| Ano de defesa: | 2021 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Tese |
| Tipo de acesso: | Acesso aberto |
| Idioma: | eng |
| Instituição de defesa: |
Biblioteca Digitais de Teses e Dissertações da USP
|
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Link de acesso: | https://www.teses.usp.br/teses/disponiveis/55/55134/tde-26032021-102400/ |
Resumo: | Machine learning is an application of artificial intelligence with focus on the development of computer programs that can access data and use them to learn for themselves. High level data classification is a technique based on data pattern formation, instead of only their physical features. Complex networks have been proven to be quite useful for characterizing relationships among data samples and, consequently, they are a powerful mechanism to capture data patterns. In this work, we investigate novel ways of using the network-based approach in the development of high level classification techniques. Initially, two classification techniques are introduced, and their performances are assessed by applying them to benchmark datasets, both artificial and real, as well as comparing their results to those achieved by traditional classification models, on the same data. Afterwards, we explore the inherent advantages offered by this type of approach, such as its versatility and interpretability, by developing novel network-based techniques specifically designed to be applied on data concerning real and relevant problems from very diverse fields, from the financial market to corruption among politicians and healthcare. Although these type of applications certainly require a greater amount of effort from the part of researchers, in terms of the challenge and data preprocessing, we believe they are important to bring academic research closer to the reality. Among our findings, there is the uncovering of an unexpected relationship between legislative voting data and convictions for corruption or other financial crimes among Brazilian representatives. We also demonstrate how one can adapt a model, which originally has been applied to detect periodicity in meteorological data, for identifying up and down trends in the stock market, automatically triggering a buying or a selling order for the asset, accordingly. In another investigation, a technique to help healthcare workers in the task of monitoring COVID-19 patients is presented, by detecting early signs of hepatic, renal or respiratory insufficiency solely based on Complete Blood Count (CBC) test results. In summary, we believe this work makes an important contribution to the advance of large scale public data study using complex networks. |
| id |
USP_e42c84360d27b9766127ec5875fd49cb |
|---|---|
| oai_identifier_str |
oai:teses.usp.br:tde-26032021-102400 |
| network_acronym_str |
USP |
| network_name_str |
Biblioteca Digital de Teses e Dissertações da USP |
| repository_id_str |
|
| spelling |
Network-based high level classification: novel models and applicationsClassificação de alto nível baseada em redes: novos modelos e aplicaçõesAprendizado de máquinaAutomação de investimentosBills votingCBC testClassificação de dados de alto nívelComplex networksCorruption predictionCOVID- 19COVID-19Detecção de insuficiênciaHemogramaHigh level data classificationInsufficiency detectionMachine learningMercado de açõesPartidos políticosPolitical partiesPredição de corrupçãoRedes complexasStock marketStock trading automationVotações legislativasMachine learning is an application of artificial intelligence with focus on the development of computer programs that can access data and use them to learn for themselves. High level data classification is a technique based on data pattern formation, instead of only their physical features. Complex networks have been proven to be quite useful for characterizing relationships among data samples and, consequently, they are a powerful mechanism to capture data patterns. In this work, we investigate novel ways of using the network-based approach in the development of high level classification techniques. Initially, two classification techniques are introduced, and their performances are assessed by applying them to benchmark datasets, both artificial and real, as well as comparing their results to those achieved by traditional classification models, on the same data. Afterwards, we explore the inherent advantages offered by this type of approach, such as its versatility and interpretability, by developing novel network-based techniques specifically designed to be applied on data concerning real and relevant problems from very diverse fields, from the financial market to corruption among politicians and healthcare. Although these type of applications certainly require a greater amount of effort from the part of researchers, in terms of the challenge and data preprocessing, we believe they are important to bring academic research closer to the reality. Among our findings, there is the uncovering of an unexpected relationship between legislative voting data and convictions for corruption or other financial crimes among Brazilian representatives. We also demonstrate how one can adapt a model, which originally has been applied to detect periodicity in meteorological data, for identifying up and down trends in the stock market, automatically triggering a buying or a selling order for the asset, accordingly. In another investigation, a technique to help healthcare workers in the task of monitoring COVID-19 patients is presented, by detecting early signs of hepatic, renal or respiratory insufficiency solely based on Complete Blood Count (CBC) test results. In summary, we believe this work makes an important contribution to the advance of large scale public data study using complex networks.Aprendizado de máquina é uma aplicação da inteligência artificial com foco no desenvolvimento de programas de computador que podem acessar dados e usá-los para aprender por conta própria. Classificação de dados de alto nivel é uma técnica baseada na formação de padrão nos dados, ao invés de somente nas suas características físicas. Redes complexas têm se mostrado bastante úteis para caracterizar relacionamentos entre amostras de dados e, conseqüentemente, são um poderoso mecanismo de captura de padrões de dados. Neste trabalho, são investigadas novas maneiras de se usar a abordagem baseada em rede no desenvolvimento de técnicas de classificação de alto nível. Inicialmente, duas técnicas de classificação são introduzidas, e seus desempenhos são avaliados aplicando-as a conjuntos de dados de referência na área, tanto artificiais quanto reais, bem como comparando seus resultados com aqueles obtidos por modelos de classificação tradicionais, nos mesmos dados. Posteriormente, são exploradas as vantagens inerentes a este tipo de abordagem, tais como a sua versatilidade e interpretabilidade, para se desenvolver novas técnicas baseadas em rede especificamente projetadas para serem aplicadas em dados de problemas reais e relevantes em campos muito diversos, desde o mercado financeiro à corrupção de políticos e cuidados de saúde. Embora estes tipos de aplicação certamente requerem um esforço maior por parte dos pesquisadores, em termos do desafio e pré-processamento dos dados, acredita-se que elas são importantes para aproximar a pesquisa acadêmica da realidade. Entre os resultados obtidos neste trabalho, está a detecção de uma relação não esperada entre dados de votação de projetos de lei e condenações por corrupção e outros crimes financeiros entre deputados brasileiros. Também é demonstrado como é possível adaptar um modelo, que originalmente foi aplicado na detecção de periodicidade em dados meteorológicos, para identificar tendências de alta e de baixa no mercado de ações, acionando automaticamente uma ordem de compra ou de venda para o ativo, de acordo com a situação. Em outra investigação, é apresentada uma técnica para auxiliar os profissionais de saúde na tarefa de monitorar pacientes com COVID-19, por meio da detecção de sinais prévios de insuficiência hepática, renal ou respiratória, apenas com base nos resultados do exame de hemograma completo. Em resumo, acredita-se que este trabalho faz uma importante contribuição para o avanço do estudo de dados públicos em larga escala usando redes complexas.Biblioteca Digitais de Teses e Dissertações da USPLiang, ZhaoColliri, Tiago Santos2021-01-29info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfhttps://www.teses.usp.br/teses/disponiveis/55/55134/tde-26032021-102400/reponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USPLiberar o conteúdo para acesso público.info:eu-repo/semantics/openAccesseng2021-03-26T16:39:03Zoai:teses.usp.br:tde-26032021-102400Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.bropendoar:27212021-03-26T16:39:03Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false |
| dc.title.none.fl_str_mv |
Network-based high level classification: novel models and applications Classificação de alto nível baseada em redes: novos modelos e aplicações |
| title |
Network-based high level classification: novel models and applications |
| spellingShingle |
Network-based high level classification: novel models and applications Colliri, Tiago Santos Aprendizado de máquina Automação de investimentos Bills voting CBC test Classificação de dados de alto nível Complex networks Corruption prediction COVID- 19 COVID-19 Detecção de insuficiência Hemograma High level data classification Insufficiency detection Machine learning Mercado de ações Partidos políticos Political parties Predição de corrupção Redes complexas Stock market Stock trading automation Votações legislativas |
| title_short |
Network-based high level classification: novel models and applications |
| title_full |
Network-based high level classification: novel models and applications |
| title_fullStr |
Network-based high level classification: novel models and applications |
| title_full_unstemmed |
Network-based high level classification: novel models and applications |
| title_sort |
Network-based high level classification: novel models and applications |
| author |
Colliri, Tiago Santos |
| author_facet |
Colliri, Tiago Santos |
| author_role |
author |
| dc.contributor.none.fl_str_mv |
Liang, Zhao |
| dc.contributor.author.fl_str_mv |
Colliri, Tiago Santos |
| dc.subject.por.fl_str_mv |
Aprendizado de máquina Automação de investimentos Bills voting CBC test Classificação de dados de alto nível Complex networks Corruption prediction COVID- 19 COVID-19 Detecção de insuficiência Hemograma High level data classification Insufficiency detection Machine learning Mercado de ações Partidos políticos Political parties Predição de corrupção Redes complexas Stock market Stock trading automation Votações legislativas |
| topic |
Aprendizado de máquina Automação de investimentos Bills voting CBC test Classificação de dados de alto nível Complex networks Corruption prediction COVID- 19 COVID-19 Detecção de insuficiência Hemograma High level data classification Insufficiency detection Machine learning Mercado de ações Partidos políticos Political parties Predição de corrupção Redes complexas Stock market Stock trading automation Votações legislativas |
| description |
Machine learning is an application of artificial intelligence with focus on the development of computer programs that can access data and use them to learn for themselves. High level data classification is a technique based on data pattern formation, instead of only their physical features. Complex networks have been proven to be quite useful for characterizing relationships among data samples and, consequently, they are a powerful mechanism to capture data patterns. In this work, we investigate novel ways of using the network-based approach in the development of high level classification techniques. Initially, two classification techniques are introduced, and their performances are assessed by applying them to benchmark datasets, both artificial and real, as well as comparing their results to those achieved by traditional classification models, on the same data. Afterwards, we explore the inherent advantages offered by this type of approach, such as its versatility and interpretability, by developing novel network-based techniques specifically designed to be applied on data concerning real and relevant problems from very diverse fields, from the financial market to corruption among politicians and healthcare. Although these type of applications certainly require a greater amount of effort from the part of researchers, in terms of the challenge and data preprocessing, we believe they are important to bring academic research closer to the reality. Among our findings, there is the uncovering of an unexpected relationship between legislative voting data and convictions for corruption or other financial crimes among Brazilian representatives. We also demonstrate how one can adapt a model, which originally has been applied to detect periodicity in meteorological data, for identifying up and down trends in the stock market, automatically triggering a buying or a selling order for the asset, accordingly. In another investigation, a technique to help healthcare workers in the task of monitoring COVID-19 patients is presented, by detecting early signs of hepatic, renal or respiratory insufficiency solely based on Complete Blood Count (CBC) test results. In summary, we believe this work makes an important contribution to the advance of large scale public data study using complex networks. |
| publishDate |
2021 |
| dc.date.none.fl_str_mv |
2021-01-29 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
| format |
doctoralThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
https://www.teses.usp.br/teses/disponiveis/55/55134/tde-26032021-102400/ |
| url |
https://www.teses.usp.br/teses/disponiveis/55/55134/tde-26032021-102400/ |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
|
| dc.rights.driver.fl_str_mv |
Liberar o conteúdo para acesso público. info:eu-repo/semantics/openAccess |
| rights_invalid_str_mv |
Liberar o conteúdo para acesso público. |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.coverage.none.fl_str_mv |
|
| dc.publisher.none.fl_str_mv |
Biblioteca Digitais de Teses e Dissertações da USP |
| publisher.none.fl_str_mv |
Biblioteca Digitais de Teses e Dissertações da USP |
| dc.source.none.fl_str_mv |
reponame:Biblioteca Digital de Teses e Dissertações da USP instname:Universidade de São Paulo (USP) instacron:USP |
| instname_str |
Universidade de São Paulo (USP) |
| instacron_str |
USP |
| institution |
USP |
| reponame_str |
Biblioteca Digital de Teses e Dissertações da USP |
| collection |
Biblioteca Digital de Teses e Dissertações da USP |
| repository.name.fl_str_mv |
Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP) |
| repository.mail.fl_str_mv |
virginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.br |
| _version_ |
1865491339672551424 |