Machine learning algorithms to solve statistical problems
| Ano de defesa: | 2020 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Dissertação |
| Tipo de acesso: | Acesso aberto |
| Idioma: | eng |
| Instituição de defesa: |
Não Informado pela instituição
|
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Link de acesso: | http://www.repositorio.ufc.br/handle/riufc/56651 |
Resumo: | The field of artificial intelligence before its popularity, begin to be spread around the world as a computational tools from a distant future. In 1950, Artificial Neural Network (ANN) begin to be developed and all compute intelligence algorithms follow the same way , making that future be more closer now a days. Currently, machine learning, an AI branch, is used in many different fields and processes such as marketing and sales, business intelligence, research and development, supply chain, financial stocks, human resources, healthcare, etc. The maturation of AI field carries itself a probabilistic bases that can be used to solve some problems in statistics. In this work we make use of machine learning techniques and algorithms to solve two proposed statistical problems. In chapter 1, the issue is to find an approximation to normal cumulative distribution function. This expression needs to be mathematically and computationally simpler than other approximations founded in statistics lectures and papers and one possible use of this expression is in introductory statistics classrooms. Chapter 2 we address an identifiability distribution problem using machine learning algorithm and a framework for mathematical computation called Tensorflow and an abstraction library for deep learning routines called Keras, both of them written in Python. The main goal here is construct a structure that can be able to capture features from a sample provided by the user and classify the parent distribution of this sample. The results were promising with a accuracy greater then 95% for each distribution used for examples. |
| id |
UFC-7_0e9475abea92f5896480809b472c7c05 |
|---|---|
| oai_identifier_str |
oai:repositorio.ufc.br:riufc/56651 |
| network_acronym_str |
UFC-7 |
| network_name_str |
Repositório Institucional da Universidade Federal do Ceará (UFC) |
| repository_id_str |
|
| spelling |
Souza, Fábio Hemerson Araújo deFreitas, Silvia Maria dePinho, Luis Gustavo Bastos2021-02-22T10:37:17Z2021-02-22T10:37:17Z2020SOUZA, Fábio Hemerson Araújo de. Machine learning algorithms to solve statistical problems. 2020. 45 f. Dissertação (Mestrado em Modelagem e Métodos Quantitativos) - Centro de Ciências, Universidade Federal do Ceará, Fortaleza, 2020.http://www.repositorio.ufc.br/handle/riufc/56651The field of artificial intelligence before its popularity, begin to be spread around the world as a computational tools from a distant future. In 1950, Artificial Neural Network (ANN) begin to be developed and all compute intelligence algorithms follow the same way , making that future be more closer now a days. Currently, machine learning, an AI branch, is used in many different fields and processes such as marketing and sales, business intelligence, research and development, supply chain, financial stocks, human resources, healthcare, etc. The maturation of AI field carries itself a probabilistic bases that can be used to solve some problems in statistics. In this work we make use of machine learning techniques and algorithms to solve two proposed statistical problems. In chapter 1, the issue is to find an approximation to normal cumulative distribution function. This expression needs to be mathematically and computationally simpler than other approximations founded in statistics lectures and papers and one possible use of this expression is in introductory statistics classrooms. Chapter 2 we address an identifiability distribution problem using machine learning algorithm and a framework for mathematical computation called Tensorflow and an abstraction library for deep learning routines called Keras, both of them written in Python. The main goal here is construct a structure that can be able to capture features from a sample provided by the user and classify the parent distribution of this sample. The results were promising with a accuracy greater then 95% for each distribution used for examples.Antes de sua popularidade o campo da inteligência artificial começou a se espalhar pelo mundo como uma ferramenta computacional de um futuro distante. Em 1950, as Redes Neurais Artificiais (RNA) começaram a ser desenvolvidas e todos os algoritmos de inteligência computacional seguiram o mesmo caminho, fazendo com que esse futuro ficasse cada vez mais próximo. Atualmente, o aprendizado de máquina, um ramo da IA, é usado em muitos campos e processos diferentes, como marketing e vendas, inteligência de negócios, pesquisa e desenvolvimento, cadeia de suprimentos, estoques financeiros, recursos humanos, saúde, etc. O amadurecimento da área de IA trás consigo uma base probabilística que pode ser usada para resolver alguns problemas em estatística. Neste trabalho fazemos uso de técnicas e algoritmos de aprendizado de máquina para resolver dois problemas estatísticos propostos. No capítulo 1, o problema é encontrar uma aproximação para a função de distribuição acumulada para distribuição Normal. Esta expressão precisa ser matemática e computacionalmente mais simples do que outras aproximações encontradas em palestras e artigos de estatística e um possível uso dessa expressão é em sala de aula em disciplinas de introdução à estatística. No capítulo 2 abordamos um problema de distribuição de identificabilidade usando algoritmo de aprendizado de máquina e uma estrutura para computação matemática chamada textit Tensorflow e uma biblioteca de abstração para rotinas de aprendizagem profunda chamada textit Keras, ambas escritas em Python. O objetivo principal aqui é construir uma estrutura que possa ser capaz de capturar características de uma amostra fornecida pelo usuário e classificar a distribuição original dessa amostra. Os resultados foram promissores com uma precisão superior a 95 % para cada distribuição usada nos exemplos.Inteligência artificialAprendizado de máquinaAlgoritmosPIPERedes neuraisArtificial inteligenceMachine learningAlgorithmsNeural NetworkMachine learning algorithms to solve statistical problemsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisengreponame:Repositório Institucional da Universidade Federal do Ceará (UFC)instname:Universidade Federal do Ceará (UFC)instacron:UFCinfo:eu-repo/semantics/openAccessORIGINAL2020_dis_fhasouza.pdf2020_dis_fhasouza.pdfapplication/pdf490489http://repositorio.ufc.br/bitstream/riufc/56651/7/2020_dis_fhasouza.pdf06a05828edc43c6fef31c4122e703d1dMD57LICENSElicense.txtlicense.txttext/plain; charset=utf-81893http://repositorio.ufc.br/bitstream/riufc/56651/8/license.txt4d8f4e989fd8622bc24a719aca4d64ceMD58riufc/566512021-02-22 07:37:17.472oai:repositorio.ufc.br:riufc/56651TElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTyBFWENMVVNJVkEgREEgVUZDDQoNCkFvIGNvbmNvcmRhciBlbSBjb25jZWRlciBlc3RhIGxpY2Vuw6dhLCB2b2PDqiAobyAocykgYXV0b3IgKGVzKSBvdSBjb3B5cmlnaHQgcHJvcHJpZXTDoXJpbykgY29uY2VkZSDDoCBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkbyBDZWFyw6EgKFVGQykgbyBkaXJlaXRvIG7Do28gZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsDQp0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSBlIC8gb3UgZGlzdHJpYnVpciBzZXUgdHJhYmFsaG8gKGluY2x1aW5kbw0KbyByZXN1bW8pIGVtIHRvZG8gbyBtdW5kbyBlbSBmb3JtYXRvIGltcHJlc3NvIGUgZWxldHLDtG5pY28gZSBlbSBxdWFscXVlciBtZWlvLA0KaW5jbHVpbmRvLCBtYXMgbsOjbyBzZSBsaW1pdGFuZG8gYSwgw6F1ZGlvIG91IHbDrWRlby4NCg0KVm9jw6ogY29uY29yZGEgcXVlIGEgVUZDIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFkdXppciBvDQpzdWJtaXNzw6NvIGEgcXVhbHF1ZXIgbWVpbyBvdSBmb3JtYXRvIHBhcmEgZmlucyBkZSBwcmVzZXJ2YcOnw6NvLg0KDQpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBhIFVGQyBwb2RlIG1hbnRlciBtYWlzIGRlIHVtYSBjw7NwaWEgZGVzdGUgdHJhYmFsaG8gcGFyYQ0KZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrdXAgZSBwcmVzZXJ2YcOnw6NvLg0KDQpWb2PDqiBkZWNsYXJhIHF1ZSBvIGRvY3VtZW50byBzdWJtZXRpZG8gYW8gcmVwb3NpdMOzcmlvIMOpIHNldSB0cmFiYWxobyBvcmlnaW5hbCBlIHF1ZSB2b2PDqiB0ZW0gbyBkaXJlaXRvIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIHNlIHJlc3BvbnNhYmlsaXphIGRlIHF1ZSBvIHNldSB0cmFiYWxobyBuw6NvIGluZnJpbmdlLCBhdMOpIG9uZGUgdm9jw6ogc2FiZSwgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uDQoNClNlIG8gdHJhYmFsaG8gY29udGl2ZXIgbWF0ZXJpYWwgc29icmUgbyBxdWFsIHZvY8OqIG7Do28gcG9zc3VpIGRpcmVpdG9zIGF1dG9yYWlzLA0Kdm9jw6ogZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIHByb3ByaWV0w6FyaW8gZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgYSBVRkMgb3MgZGlyZWl0b3MgZXhpZ2lkb3MgcG9yIGVzdGEgbGljZW7Dp2EsIGUgcXVlIHRhbCBtYXRlcmlhbCBkZSB0ZXJjZWlyb3Mgw6kgY2xhcmFtZW50ZSBpZGVudGlmaWNhZG8gZSByZWNvbmhlY2lkbw0KZGVudHJvIGRvIHRleHRvIG91IGNvbnRlw7pkbyBkYSBzdWJtaXNzw6NvLg0KDQpTZSBvIHRyYWJhbGhvIGRlcG9zaXRhZG8gZm9yIGJhc2VhZG8gZW0gdHJhYmFsaG8gcGF0cm9jaW5hZG8gb3UgYXBvaWFkbw0KcG9yIHVtYSBhZ8OqbmNpYSBvdSBvcmdhbml6YcOnw6NvIHF1ZSBuw6NvIHNlamEgYSBVRkMsIHZvY8OqIHNlIHJlc3BvbnNhYmlsaXphIHBvcg0KY3VtcHJpciBxdWFscXVlciBkaXJlaXRvIGRlIHJldmlzw6NvIG91IG91dHJhcyBvYnJpZ2HDp8O1ZXMgZXhpZ2lkYXMgcG9yIHRhaXMNCmNvbnRyYXRvIG91IGFjb3Jkby4NCg0KQSBVRkMgIGlkZW50aWZpY2Fyw6EgY2xhcmFtZW50ZSBzZXUgKHMpIG5vbWUgKHMpIGNvbW8gYXV0b3IgKGVzKSBvdSBwcm9wcmlldMOhcmlvIChzKSBkbw0KdHJhYmFsaG8gc3VibWV0aWRvLCBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIGV4Y2V0byBjb25mb3JtZSBwZXJtaXRpZG8gcG9yIGVzdGUNCmxpY2Vuw6dhLCBxdWUgZXN0w6Egc2VuZG8gYXF1aSBhcHJlc2VudGFkYS4NCg0KQ29vcmRlbmHDp8OjbyBkbyBSZXBvc2l0w7NyaW8gSW5zdGl0dWNpb25hbCBkYSBVRkMNCg0KRepositório InstitucionalPUBhttp://www.repositorio.ufc.br/ri-oai/requestbu@ufc.br || repositorio@ufc.bropendoar:2021-02-22T10:37:17Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC)false |
| dc.title.pt_BR.fl_str_mv |
Machine learning algorithms to solve statistical problems |
| title |
Machine learning algorithms to solve statistical problems |
| spellingShingle |
Machine learning algorithms to solve statistical problems Souza, Fábio Hemerson Araújo de Inteligência artificial Aprendizado de máquina Algoritmos PIPE Redes neurais Artificial inteligence Machine learning Algorithms Neural Network |
| title_short |
Machine learning algorithms to solve statistical problems |
| title_full |
Machine learning algorithms to solve statistical problems |
| title_fullStr |
Machine learning algorithms to solve statistical problems |
| title_full_unstemmed |
Machine learning algorithms to solve statistical problems |
| title_sort |
Machine learning algorithms to solve statistical problems |
| author |
Souza, Fábio Hemerson Araújo de |
| author_facet |
Souza, Fábio Hemerson Araújo de |
| author_role |
author |
| dc.contributor.co-advisor.none.fl_str_mv |
Freitas, Silvia Maria de |
| dc.contributor.author.fl_str_mv |
Souza, Fábio Hemerson Araújo de |
| dc.contributor.advisor1.fl_str_mv |
Pinho, Luis Gustavo Bastos |
| contributor_str_mv |
Pinho, Luis Gustavo Bastos |
| dc.subject.por.fl_str_mv |
Inteligência artificial Aprendizado de máquina Algoritmos PIPE Redes neurais Artificial inteligence Machine learning Algorithms Neural Network |
| topic |
Inteligência artificial Aprendizado de máquina Algoritmos PIPE Redes neurais Artificial inteligence Machine learning Algorithms Neural Network |
| description |
The field of artificial intelligence before its popularity, begin to be spread around the world as a computational tools from a distant future. In 1950, Artificial Neural Network (ANN) begin to be developed and all compute intelligence algorithms follow the same way , making that future be more closer now a days. Currently, machine learning, an AI branch, is used in many different fields and processes such as marketing and sales, business intelligence, research and development, supply chain, financial stocks, human resources, healthcare, etc. The maturation of AI field carries itself a probabilistic bases that can be used to solve some problems in statistics. In this work we make use of machine learning techniques and algorithms to solve two proposed statistical problems. In chapter 1, the issue is to find an approximation to normal cumulative distribution function. This expression needs to be mathematically and computationally simpler than other approximations founded in statistics lectures and papers and one possible use of this expression is in introductory statistics classrooms. Chapter 2 we address an identifiability distribution problem using machine learning algorithm and a framework for mathematical computation called Tensorflow and an abstraction library for deep learning routines called Keras, both of them written in Python. The main goal here is construct a structure that can be able to capture features from a sample provided by the user and classify the parent distribution of this sample. The results were promising with a accuracy greater then 95% for each distribution used for examples. |
| publishDate |
2020 |
| dc.date.issued.fl_str_mv |
2020 |
| dc.date.accessioned.fl_str_mv |
2021-02-22T10:37:17Z |
| dc.date.available.fl_str_mv |
2021-02-22T10:37:17Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.citation.fl_str_mv |
SOUZA, Fábio Hemerson Araújo de. Machine learning algorithms to solve statistical problems. 2020. 45 f. Dissertação (Mestrado em Modelagem e Métodos Quantitativos) - Centro de Ciências, Universidade Federal do Ceará, Fortaleza, 2020. |
| dc.identifier.uri.fl_str_mv |
http://www.repositorio.ufc.br/handle/riufc/56651 |
| identifier_str_mv |
SOUZA, Fábio Hemerson Araújo de. Machine learning algorithms to solve statistical problems. 2020. 45 f. Dissertação (Mestrado em Modelagem e Métodos Quantitativos) - Centro de Ciências, Universidade Federal do Ceará, Fortaleza, 2020. |
| url |
http://www.repositorio.ufc.br/handle/riufc/56651 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da Universidade Federal do Ceará (UFC) instname:Universidade Federal do Ceará (UFC) instacron:UFC |
| instname_str |
Universidade Federal do Ceará (UFC) |
| instacron_str |
UFC |
| institution |
UFC |
| reponame_str |
Repositório Institucional da Universidade Federal do Ceará (UFC) |
| collection |
Repositório Institucional da Universidade Federal do Ceará (UFC) |
| bitstream.url.fl_str_mv |
http://repositorio.ufc.br/bitstream/riufc/56651/7/2020_dis_fhasouza.pdf http://repositorio.ufc.br/bitstream/riufc/56651/8/license.txt |
| bitstream.checksum.fl_str_mv |
06a05828edc43c6fef31c4122e703d1d 4d8f4e989fd8622bc24a719aca4d64ce |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 |
| repository.name.fl_str_mv |
Repositório Institucional da Universidade Federal do Ceará (UFC) - Universidade Federal do Ceará (UFC) |
| repository.mail.fl_str_mv |
bu@ufc.br || repositorio@ufc.br |
| _version_ |
1847793263831941120 |