Geração automática de fluxos de tarefas para problemas de aprendizado de máquina
| Ano de defesa: | 2018 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Dissertação |
| Tipo de acesso: | Acesso aberto |
| Idioma: | por |
| Instituição de defesa: |
Universidade Federal de Minas Gerais
|
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Link de acesso: | https://hdl.handle.net/1843/30744 |
Resumo: | Automatic Machine Learning is a growing area of machine learning that has a similar objective to the area of hyper-heuristics: to automatically recommend optimized pipelines, algorithms or appropriate parameters to specific tasks without much dependency on user knowledge. The background knowledge required to solve the task at hand is actually embedded into a search mechanism that builds personalized solutions to the task. Following this idea, this thesis proposes RECIPE (REsilient ClassifIcation Pipeline Evolution), a framework based on grammar-based genetic programming that builds customized classification pipelines. The framework is flexible enough to receive different grammars and can be easily extended to other machine learning tasks. RECIPE overcomes the drawbacks of previous evolutionary-based frameworks, such as generating invalid individuals, and organizes a high number of possible suitable data pre-processing and classification methods into a grammar. Results of f-measure obtained by RECIPE are compared to those two state-of-the-art methods, and shown to be as good as or better than those previously reported in the literature. RECIPE represents a first step towards a complete framework for dealing with different machine learning tasks with the minimum required human intervention. |
| id |
UFMG_8defce7a6e92ad08e131cfcd1aaa7f90 |
|---|---|
| oai_identifier_str |
oai:repositorio.ufmg.br:1843/30744 |
| network_acronym_str |
UFMG |
| network_name_str |
Repositório Institucional da UFMG |
| repository_id_str |
|
| spelling |
2019-10-31T13:03:28Z2025-09-08T23:47:10Z2019-10-31T13:03:28Z2018-08-06https://hdl.handle.net/1843/30744Automatic Machine Learning is a growing area of machine learning that has a similar objective to the area of hyper-heuristics: to automatically recommend optimized pipelines, algorithms or appropriate parameters to specific tasks without much dependency on user knowledge. The background knowledge required to solve the task at hand is actually embedded into a search mechanism that builds personalized solutions to the task. Following this idea, this thesis proposes RECIPE (REsilient ClassifIcation Pipeline Evolution), a framework based on grammar-based genetic programming that builds customized classification pipelines. The framework is flexible enough to receive different grammars and can be easily extended to other machine learning tasks. RECIPE overcomes the drawbacks of previous evolutionary-based frameworks, such as generating invalid individuals, and organizes a high number of possible suitable data pre-processing and classification methods into a grammar. Results of f-measure obtained by RECIPE are compared to those two state-of-the-art methods, and shown to be as good as or better than those previously reported in the literature. RECIPE represents a first step towards a complete framework for dealing with different machine learning tasks with the minimum required human intervention.porUniversidade Federal de Minas GeraisAprendizado de Máquina AutomáticoRECIPEFluxos de tarefasGeração automática de fluxos de tarefas para problemas de aprendizado de máquinainfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisWalter José Gonçalves da Silva Pintoinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGhttp://lattes.cnpq.br/6668433953595024Gisele Lobo Pappahttp://lattes.cnpq.br/5936682335701497Gisele Lobo PappaAna Paula Couto da SilvaLuiz Henrique Zárate GálvezSandro Carvalho IzidoroA área de Aprendizado de Máquina Automático tem como objetivo recomendar automaticamente fluxos de tarefas que devem ser seguidas para criar algoritmos de aprendizado personalizados para uma dada base de dados. Essas tarefas incluem métodos de pré-processamentodedados, algoritmosdeaprendizadoeseusparâmetrosetécnicasde pós-processamento. Agrandevantagemdosmétodosdessaáreaestáemsuacapacidade de gerar fluxos sem dependência de conhecimento especializado do usuário realizando a tarefa. Esta dissertação propõe o RECIPE (REsilient ClassifIcation Pipeline Evolution), um método que faz uso de programação genética baseada em gramática para buscar por esses fluxos de tarefa considerando problemas de classificação. O RECIPE é flexível o suficiente para receber diferentes gramáticas e pode ser facilmente estendido para outras tarefas de aprendizado. Os resultados da medida F1 obtidos pelo RECIPE em 10 bases de dados são comparados a dois métodos estado da arte nessa tarefa, e são tão bons ou melhores do que os relatados anteriormente na literatura.BrasilPrograma de Pós-Graduação em Ciência da ComputaçãoUFMGLICENSElicense.txttext/plain2119https://repositorio.ufmg.br//bitstreams/5a75ff15-5e4e-42bd-8f47-b5a5b798559b/download34badce4be7e31e3adb4575ae96af679MD51falseAnonymousREADORIGINALWalterJoseGoncalvesdaSilvaPinto.pdfapplication/pdf2498315https://repositorio.ufmg.br//bitstreams/762496b8-51ae-43d0-8a0d-27584a31dc91/downloadfd2336065d97e34fb64faeb6556fd75cMD52trueAnonymousREADTEXTWalterJoseGoncalvesdaSilvaPinto.pdf.txttext/plain147596https://repositorio.ufmg.br//bitstreams/32d21bb1-55d7-4605-967f-e72637e0ded2/download84b81aed64aaa3fee658244bba4ad5a0MD53falseAnonymousREADTHUMBNAILWalterJoseGoncalvesdaSilvaPinto.pdf.jpgWalterJoseGoncalvesdaSilvaPinto.pdf.jpgGenerated Thumbnailimage/jpeg2398https://repositorio.ufmg.br//bitstreams/febe21b9-5be7-46bf-8926-927837972afa/download51842fcb101106d2991966943c9662e4MD54falseAnonymousREAD1843/307442025-09-10 13:53:45.563open.accessoai:repositorio.ufmg.br:1843/30744https://repositorio.ufmg.br/Repositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2025-09-10T16:53:45Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)falseTElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTy1FWENMVVNJVkEgRE8gUkVQT1NJVMOTUklPIElOU1RJVFVDSU9OQUwgREEgVUZNRwoKQ29tIGEgYXByZXNlbnRhw6fDo28gZGVzdGEgbGljZW7Dp2EsIHZvY8OqIChvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvcikgY29uY2VkZSBhbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIChSSS1VRk1HKSBvIGRpcmVpdG8gbsOjbyBleGNsdXNpdm8gZSBpcnJldm9nw6F2ZWwgZGUgcmVwcm9kdXppciBlL291IGRpc3RyaWJ1aXIgYSBzdWEgcHVibGljYcOnw6NvIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyw7RuaWNvIGUgZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zIMOhdWRpbyBvdSB2w61kZW8uCgpWb2PDqiBkZWNsYXJhIHF1ZSBjb25oZWNlIGEgcG9sw610aWNhIGRlIGNvcHlyaWdodCBkYSBlZGl0b3JhIGRvIHNldSBkb2N1bWVudG8gZSBxdWUgY29uaGVjZSBlIGFjZWl0YSBhcyBEaXJldHJpemVzIGRvIFJJLVVGTUcuCgpWb2PDqiBjb25jb3JkYSBxdWUgbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSBwdWJsaWNhw6fDo28gcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGTUcgcG9kZSBtYW50ZXIgbWFpcyBkZSB1bWEgY8OzcGlhIGRlIHN1YSBwdWJsaWNhw6fDo28gcGFyYSBmaW5zIGRlIHNlZ3VyYW7Dp2EsIGJhY2stdXAgZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogZGVjbGFyYSBxdWUgYSBzdWEgcHVibGljYcOnw6NvIMOpIG9yaWdpbmFsIGUgcXVlIHZvY8OqIHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIG8gZGVww7NzaXRvIGRlIHN1YSBwdWJsaWNhw6fDo28gbsOjbywgcXVlIHNlamEgZGUgc2V1IGNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHB1YmxpY2HDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiBkZWNsYXJhIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgYW8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHB1YmxpY2HDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBQVUJMSUNBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UgQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyBUQU1Cw4lNIEFTIERFTUFJUyBPQlJJR0HDh8OVRVMgRVhJR0lEQVMgUE9SIENPTlRSQVRPIE9VIEFDT1JETy4KCk8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lKHMpIG91IG8ocykgbm9tZXMocykgZG8ocykgZGV0ZW50b3IoZXMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkYSBwdWJsaWNhw6fDo28sIGUgbsOjbyBmYXLDoSBxdWFscXVlciBhbHRlcmHDp8OjbywgYWzDqW0gZGFxdWVsYXMgY29uY2VkaWRhcyBwb3IgZXN0YSBsaWNlbsOnYS4KCg== |
| dc.title.none.fl_str_mv |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina |
| title |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina |
| spellingShingle |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina Walter José Gonçalves da Silva Pinto Aprendizado de Máquina Automático RECIPE Fluxos de tarefas |
| title_short |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina |
| title_full |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina |
| title_fullStr |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina |
| title_full_unstemmed |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina |
| title_sort |
Geração automática de fluxos de tarefas para problemas de aprendizado de máquina |
| author |
Walter José Gonçalves da Silva Pinto |
| author_facet |
Walter José Gonçalves da Silva Pinto |
| author_role |
author |
| dc.contributor.author.fl_str_mv |
Walter José Gonçalves da Silva Pinto |
| dc.subject.other.none.fl_str_mv |
Aprendizado de Máquina Automático RECIPE Fluxos de tarefas |
| topic |
Aprendizado de Máquina Automático RECIPE Fluxos de tarefas |
| description |
Automatic Machine Learning is a growing area of machine learning that has a similar objective to the area of hyper-heuristics: to automatically recommend optimized pipelines, algorithms or appropriate parameters to specific tasks without much dependency on user knowledge. The background knowledge required to solve the task at hand is actually embedded into a search mechanism that builds personalized solutions to the task. Following this idea, this thesis proposes RECIPE (REsilient ClassifIcation Pipeline Evolution), a framework based on grammar-based genetic programming that builds customized classification pipelines. The framework is flexible enough to receive different grammars and can be easily extended to other machine learning tasks. RECIPE overcomes the drawbacks of previous evolutionary-based frameworks, such as generating invalid individuals, and organizes a high number of possible suitable data pre-processing and classification methods into a grammar. Results of f-measure obtained by RECIPE are compared to those two state-of-the-art methods, and shown to be as good as or better than those previously reported in the literature. RECIPE represents a first step towards a complete framework for dealing with different machine learning tasks with the minimum required human intervention. |
| publishDate |
2018 |
| dc.date.issued.fl_str_mv |
2018-08-06 |
| dc.date.accessioned.fl_str_mv |
2019-10-31T13:03:28Z 2025-09-08T23:47:10Z |
| dc.date.available.fl_str_mv |
2019-10-31T13:03:28Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/1843/30744 |
| url |
https://hdl.handle.net/1843/30744 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
| publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFMG instname:Universidade Federal de Minas Gerais (UFMG) instacron:UFMG |
| instname_str |
Universidade Federal de Minas Gerais (UFMG) |
| instacron_str |
UFMG |
| institution |
UFMG |
| reponame_str |
Repositório Institucional da UFMG |
| collection |
Repositório Institucional da UFMG |
| bitstream.url.fl_str_mv |
https://repositorio.ufmg.br//bitstreams/5a75ff15-5e4e-42bd-8f47-b5a5b798559b/download https://repositorio.ufmg.br//bitstreams/762496b8-51ae-43d0-8a0d-27584a31dc91/download https://repositorio.ufmg.br//bitstreams/32d21bb1-55d7-4605-967f-e72637e0ded2/download https://repositorio.ufmg.br//bitstreams/febe21b9-5be7-46bf-8926-927837972afa/download |
| bitstream.checksum.fl_str_mv |
34badce4be7e31e3adb4575ae96af679 fd2336065d97e34fb64faeb6556fd75c 84b81aed64aaa3fee658244bba4ad5a0 51842fcb101106d2991966943c9662e4 |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG) |
| repository.mail.fl_str_mv |
repositorio@ufmg.br |
| _version_ |
1862105634804072448 |