Aplicações de estatística multivariada em análise de dados experimentais

Detalhes bibliográficos
Ano de defesa: 2023
Autor(a) principal: Zanardo, Bruno Felipe lattes
Orientador(a): Melo, Cássius Anderson Miquele De lattes
Banca de defesa: Melo, Iara Tosta E, Valdiviesso, Gustavo Do Amaral
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Alfenas
Programa de Pós-Graduação: Programa de Pós-graduação em Física
Departamento: Instituto de Ciência e Tecnologia
País: Brasil
Palavras-chave em Português:
Área do conhecimento CNPq:
Link de acesso: https://repositorio.unifal-mg.edu.br/handle/123456789/2420
Resumo: Multivariate statistics is a branch of statistics responsible for studying situations where statistics models presents more then one variable and their methods can be applied in the most diverse areas of knowledge assisting in decision making, because their methods have as main benefits the reduction of dimensionality of the model studied, making it less complex, in addition to being used in the construction of indexes, classification, association between variables and statistical inference. In this work Multivariate statistics methods were applied in tree different situations, the method of canonical correlation analysis was applied in two different situations and the principal component analysis with experimental errors method was applied in the last one. the first application referring to a socio-environmental analysis where the existence of a correlation between the human development index (HDI) and its sub-indices in relation to water consumption and sewage generation of Brazilian cities was analyzed. While the second analysis is related to high-energy physics involving the collision of heavy lead ions Pb-Pb. The third situation refers to the application of principal component analysis for the dimensionality reduction of a model of characterization of the interstellar medium. As a result, it was possible to generate a model capable of correlating the HDI with water consumption and sewage generation, with a canonical correlation of 62.4%, capable of representing the whole country, and a second model, directed only to the state of Sa˜o Paulo, with a canonical correlation of 83%. For the second scenario, involving the collision of heavy ions a canonical correlation of 99.9% was obtained, confirming the existing correlation between entropy and the number of charged particles. A second correlation, calculated from the second pair of canonical variables, returned a high correlation with 96%, however, in this model the crosssectional moment had the highest canonical weight, and can be calculated from the other variables studied such as centrality, energy and entropy. Regarding the principal component analysis, it was possible to reduce the number of variables used in the explanation of the model significantly, from 23 original variables to 8 main components, in addition to identifying that when considering the experimental errors during the analysis we obtain greater security regarding the number of variables used to explain the model.
id UNIFAL_7178bb9f581304d2aef526f9e4f8bf78
oai_identifier_str oai:repositorio.unifal-mg.edu.br:123456789/2420
network_acronym_str UNIFAL
network_name_str Repositório Institucional da Universidade Federal de Alfenas - RiUnifal
repository_id_str
spelling Zanardo, Bruno Felipehttp://lattes.cnpq.br/4002033080997386Melo, Iara Tosta EValdiviesso, Gustavo Do AmaralMelo, Cássius Anderson Miquele Dehttp://lattes.cnpq.br/52716442803350802024-07-17T20:13:01Z2023-08-25ZANARDO, Bruno Felipe. Aplicações de estatística multivariada em análise de dados experimentais. 2024. 100 f. Dissertação (Mestrado em Física) - Universidade Federal de Alfenas, Poços de Caldas, MG, 2023.https://repositorio.unifal-mg.edu.br/handle/123456789/2420Multivariate statistics is a branch of statistics responsible for studying situations where statistics models presents more then one variable and their methods can be applied in the most diverse areas of knowledge assisting in decision making, because their methods have as main benefits the reduction of dimensionality of the model studied, making it less complex, in addition to being used in the construction of indexes, classification, association between variables and statistical inference. In this work Multivariate statistics methods were applied in tree different situations, the method of canonical correlation analysis was applied in two different situations and the principal component analysis with experimental errors method was applied in the last one. the first application referring to a socio-environmental analysis where the existence of a correlation between the human development index (HDI) and its sub-indices in relation to water consumption and sewage generation of Brazilian cities was analyzed. While the second analysis is related to high-energy physics involving the collision of heavy lead ions Pb-Pb. The third situation refers to the application of principal component analysis for the dimensionality reduction of a model of characterization of the interstellar medium. As a result, it was possible to generate a model capable of correlating the HDI with water consumption and sewage generation, with a canonical correlation of 62.4%, capable of representing the whole country, and a second model, directed only to the state of Sa˜o Paulo, with a canonical correlation of 83%. For the second scenario, involving the collision of heavy ions a canonical correlation of 99.9% was obtained, confirming the existing correlation between entropy and the number of charged particles. A second correlation, calculated from the second pair of canonical variables, returned a high correlation with 96%, however, in this model the crosssectional moment had the highest canonical weight, and can be calculated from the other variables studied such as centrality, energy and entropy. Regarding the principal component analysis, it was possible to reduce the number of variables used in the explanation of the model significantly, from 23 original variables to 8 main components, in addition to identifying that when considering the experimental errors during the analysis we obtain greater security regarding the number of variables used to explain the model.A estatística multivariada é um ramo da estatística responsável por estudar situações em que se tem múltiplas variáveis e seus métodos podem ser aplicados nas mais diversas áreas do conhecimento auxiliando na tomada de decisão, isso porque seus métodos possuem como principais benefícios a redução de dimensionalidade do modelo estudado, tornando-o menos complexo, além de serem utilizados na construção de ´índices, classificação, associação entre as variáveis e inferência estatística. Neste trabalho, métodos de estatística multivariada foram aplicados em três situações distintas. Nas duas primeiras foi aplicado o método de correlação canônica e na terceira o método de análise de componentes principais (PCA) com incertezas experimentais. Sendo a primeira aplicação referente a uma análise socioambiental onde foi analisado a existência de uma correlação entre o índice de desenvolvimento humano (IDH) e seus subíndices com relação ao consumo de água e geração de esgoto dos municípios brasileiros. Enquanto a segunda análise está relacionada à Física de altas energias envolvendo a colisão de íons pesados de chumbo Pb-Pb. A terceira situação é referente a aplicação de PCA para a redução de dimensionalidade de um modelo de caracterização do meio interestelar. Como resultado foi possível gerar um modelo capaz de correlacionar o IDH com o consumo de água e a geração de esgoto com uma correlação canônica de 62,4% capaz de representar todo o país e um segundo modelo direcionado apenas para o estado de São Paulo com uma correlação canônica de 83%. Para o segundo cenário, envolvendo a colisão de íons pesados uma correlação canônica de 99,9% foi obtida, ratificando a correlação existente entre a entropia e o número de partículas carregadas. Uma segunda correlação proveniente do segundo par de variáveis canônicas retornou uma correlação também elevada com 96%, porém, neste modelo o momento transversal ficou com o maior peso canônico, podendo ser calculado a partir das demais variáveis estudadas como centralidade, energia e entropia. Com relação a análise de componentes principais foi possível reduzir o número de variáveis utilizadas na explicação do modelo de forma significativa, passando de 23 variáveis originais para 8 componentes principais, além de identificar que ao considerar a incerteza experimental durante a análise obtemos maior segurança com relação ao número de variáveis utilizadas para explicar o modelo.application/pdfporUniversidade Federal de AlfenasPrograma de Pós-graduação em FísicaUNIFAL-MGBrasilInstituto de Ciência e Tecnologiainfo:eu-repo/semantics/openAccessestatística multivariada.correlação canônica.componentes principais.análise socioambiental.física de altas energias.meio interestelar.FISICA::FISICA DAS PARTICULAS ELEMENTARES E CAMPOSAplicações de estatística multivariada em análise de dados experimentaisinfo:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/publishedVersion-4297417259498638931600600-2246940120554092110reponame:Repositório Institucional da Universidade Federal de Alfenas - RiUnifalinstname:Universidade Federal de Alfenas (UNIFAL)instacron:UNIFALZanardo, Bruno FelipeLICENSElicense.txtlicense.txttext/plain; charset=utf-81987https://repositorio.unifal-mg.edu.br/bitstreams/886b802e-4b7b-466b-a5a0-ac66f29ed81c/download31555718c4fc75849dd08f27935d4f6bMD51ORIGINALDissertacao_BrunoFelipeZanardo_2023_PPGF.pdfDissertacao_BrunoFelipeZanardo_2023_PPGF.pdfapplication/pdf2846015https://repositorio.unifal-mg.edu.br/bitstreams/8706fc6b-e71c-4fe5-a9e9-f15f37f78508/download6b74c3f5d25ed14ab8739de6d4c5a152MD52TEXTDissertacao_BrunoFelipeZanardo_2023_PPGF.pdf.txtDissertacao_BrunoFelipeZanardo_2023_PPGF.pdf.txtExtracted texttext/plain104120https://repositorio.unifal-mg.edu.br/bitstreams/c53e2548-4e8e-4d57-a673-22f213efa9eb/download2ce2ed3a88486fe342865dfa61f20929MD55THUMBNAILDissertacao_BrunoFelipeZanardo_2023_PPGF.pdf.jpgDissertacao_BrunoFelipeZanardo_2023_PPGF.pdf.jpgGenerated Thumbnailimage/jpeg2539https://repositorio.unifal-mg.edu.br/bitstreams/0ff9a04b-6b3b-49d9-b33b-9970fcf2f5e0/downloadced89e1299926006932b42c29e3ba9aaMD54123456789/24202026-01-07 14:44:02.521open.accessoai:repositorio.unifal-mg.edu.br:123456789/2420https://repositorio.unifal-mg.edu.brRepositório InstitucionalPUBhttps://bdtd.unifal-mg.edu.br:8443/oai/requestrepositorio@unifal-mg.edu.bropendoar:2026-01-07T17:44:02Repositório Institucional da Universidade Federal de Alfenas - RiUnifal - Universidade Federal de Alfenas (UNIFAL)falseTElDRU7Dh0EgREUgRElTVFJJQlVJw4fDg08gTsODTy1FWENMVVNJVkEKCkNvbSBhIGFwcmVzZW50YcOnw6NvIGRlc3RhIGxpY2Vuw6dhLCBvIGF1dG9yIG91IG8gdGl0dWxhciBkb3MgZGlyZWl0b3MgZGUgYXV0b3IgY29uY2VkZSDDoCBVbml2ZXJzaWRhZGUgCkZlZGVyYWwgZGUgQWxmZW5hcyAgKFVOSUZBTC1NRykgbyBkaXJlaXRvIG7Do28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsICB0cmFkdXppciAoY29uZm9ybWUgZGVmaW5pZG8gYWJhaXhvKSwgZS9vdSAKZGlzdHJpYnVpciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gKGluY2x1aW5kbyBvIHJlc3VtbykgcG9yIHRvZG8gbyBtdW5kbyBubyBmb3JtYXRvIGltcHJlc3NvIGUgZWxldHLDtG5pY28gZSAKZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zIMOhdWRpbyBvdSB2w61kZW8uCgpWb2PDqiBjb25jb3JkYSBxdWUgYSBVTklGQUwtTUcgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250ZcO6ZG8sIHRyYW5zcG9yIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyAKcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBhICBVTklGQUwtTUcgcG9kZSBtYW50ZXIgbWFpcyBkZSB1bWEgY8OzcGlhIGRlIHN1YSB0ZXNlIG91IApkaXNzZXJ0YcOnw6NvIHBhcmEgZmlucyBkZSBzZWd1cmFuw6dhLCBiYWNrLXVwIGUgcHJlc2VydmHDp8Ojby4KClZvY8OqIGRlY2xhcmEgcXVlIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyDDqSBvcmlnaW5hbCBlIHF1ZSB2b2PDqiB0ZW0gbyBwb2RlciBkZSBjb25jZWRlciBvcyBkaXJlaXRvcyBjb250aWRvcyAKbmVzdGEgbGljZW7Dp2EuIFZvY8OqIHRhbWLDqW0gZGVjbGFyYSBxdWUgbyBkZXDDs3NpdG8gZGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBuw6NvLCBxdWUgc2VqYSBkZSBzZXUgCmNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiAKZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc8OjbyBpcnJlc3RyaXRhIGRvIGRldGVudG9yIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBwYXJhIGNvbmNlZGVyIMOgICBVTklGQUwtTUcgCm9zIGRpcmVpdG9zIGFwcmVzZW50YWRvcyBuZXN0YSBsaWNlbsOnYSwgZSBxdWUgZXNzZSBtYXRlcmlhbCBkZSBwcm9wcmllZGFkZSBkZSB0ZXJjZWlyb3MgZXN0w6EgY2xhcmFtZW50ZSAKaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBURVNFIE9VIERJU1NFUlRBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UgCkFQT0lPIERFIFVNQSBBR8OKTkNJQSBERSBGT01FTlRPIE9VIE9VVFJPIE9SR0FOSVNNTyBRVUUgTsODTyBTRUpBIEEgIFVOSUZBTC1NRywgClZPQ8OKIERFQ0xBUkEgUVVFIFJFU1BFSVRPVSBUT0RPUyBFIFFVQUlTUVVFUiBESVJFSVRPUyBERSBSRVZJU8ODTyBDT01PIApUQU1Cw4lNIEFTIERFTUFJUyBPQlJJR0HDh8OVRVMgRVhJR0lEQVMgUE9SIENPTlRSQVRPIE9VIEFDT1JETy4KCkEgVU5JRkFMLU1HIHNlIGNvbXByb21ldGUgYSBpZGVudGlmaWNhciBjbGFyYW1lbnRlIG8gc2V1IG5vbWUgKHMpIG91IG8ocykgbm9tZShzKSBkbyhzKSAKZGV0ZW50b3IoZXMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkYSB0ZXNlIG91IGRpc3NlcnRhw6fDo28sIGUgbsOjbyBmYXLDoSBxdWFscXVlciBhbHRlcmHDp8OjbywgYWzDqW0gZGFxdWVsYXMgCmNvbmNlZGlkYXMgcG9yIGVzdGEgbGljZW7Dp2EuCg==
dc.title.pt-BR.fl_str_mv Aplicações de estatística multivariada em análise de dados experimentais
title Aplicações de estatística multivariada em análise de dados experimentais
spellingShingle Aplicações de estatística multivariada em análise de dados experimentais
Zanardo, Bruno Felipe
estatística multivariada.
correlação canônica.
componentes principais.
análise socioambiental.
física de altas energias.
meio interestelar.
FISICA::FISICA DAS PARTICULAS ELEMENTARES E CAMPOS
title_short Aplicações de estatística multivariada em análise de dados experimentais
title_full Aplicações de estatística multivariada em análise de dados experimentais
title_fullStr Aplicações de estatística multivariada em análise de dados experimentais
title_full_unstemmed Aplicações de estatística multivariada em análise de dados experimentais
title_sort Aplicações de estatística multivariada em análise de dados experimentais
author Zanardo, Bruno Felipe
author_facet Zanardo, Bruno Felipe
author_role author
dc.contributor.author.fl_str_mv Zanardo, Bruno Felipe
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/4002033080997386
dc.contributor.referee1.fl_str_mv Melo, Iara Tosta E
dc.contributor.referee2.fl_str_mv Valdiviesso, Gustavo Do Amaral
dc.contributor.advisor1.fl_str_mv Melo, Cássius Anderson Miquele De
dc.contributor.authorLattes.fl_str_mv http://lattes.cnpq.br/5271644280335080
contributor_str_mv Melo, Iara Tosta E
Valdiviesso, Gustavo Do Amaral
Melo, Cássius Anderson Miquele De
dc.subject.por.fl_str_mv estatística multivariada.
correlação canônica.
componentes principais.
análise socioambiental.
física de altas energias.
meio interestelar.
topic estatística multivariada.
correlação canônica.
componentes principais.
análise socioambiental.
física de altas energias.
meio interestelar.
FISICA::FISICA DAS PARTICULAS ELEMENTARES E CAMPOS
dc.subject.cnpq.fl_str_mv FISICA::FISICA DAS PARTICULAS ELEMENTARES E CAMPOS
description Multivariate statistics is a branch of statistics responsible for studying situations where statistics models presents more then one variable and their methods can be applied in the most diverse areas of knowledge assisting in decision making, because their methods have as main benefits the reduction of dimensionality of the model studied, making it less complex, in addition to being used in the construction of indexes, classification, association between variables and statistical inference. In this work Multivariate statistics methods were applied in tree different situations, the method of canonical correlation analysis was applied in two different situations and the principal component analysis with experimental errors method was applied in the last one. the first application referring to a socio-environmental analysis where the existence of a correlation between the human development index (HDI) and its sub-indices in relation to water consumption and sewage generation of Brazilian cities was analyzed. While the second analysis is related to high-energy physics involving the collision of heavy lead ions Pb-Pb. The third situation refers to the application of principal component analysis for the dimensionality reduction of a model of characterization of the interstellar medium. As a result, it was possible to generate a model capable of correlating the HDI with water consumption and sewage generation, with a canonical correlation of 62.4%, capable of representing the whole country, and a second model, directed only to the state of Sa˜o Paulo, with a canonical correlation of 83%. For the second scenario, involving the collision of heavy ions a canonical correlation of 99.9% was obtained, confirming the existing correlation between entropy and the number of charged particles. A second correlation, calculated from the second pair of canonical variables, returned a high correlation with 96%, however, in this model the crosssectional moment had the highest canonical weight, and can be calculated from the other variables studied such as centrality, energy and entropy. Regarding the principal component analysis, it was possible to reduce the number of variables used in the explanation of the model significantly, from 23 original variables to 8 main components, in addition to identifying that when considering the experimental errors during the analysis we obtain greater security regarding the number of variables used to explain the model.
publishDate 2023
dc.date.issued.fl_str_mv 2023-08-25
dc.date.accessioned.fl_str_mv 2024-07-17T20:13:01Z
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv ZANARDO, Bruno Felipe. Aplicações de estatística multivariada em análise de dados experimentais. 2024. 100 f. Dissertação (Mestrado em Física) - Universidade Federal de Alfenas, Poços de Caldas, MG, 2023.
dc.identifier.uri.fl_str_mv https://repositorio.unifal-mg.edu.br/handle/123456789/2420
identifier_str_mv ZANARDO, Bruno Felipe. Aplicações de estatística multivariada em análise de dados experimentais. 2024. 100 f. Dissertação (Mestrado em Física) - Universidade Federal de Alfenas, Poços de Caldas, MG, 2023.
url https://repositorio.unifal-mg.edu.br/handle/123456789/2420
dc.language.iso.fl_str_mv por
language por
dc.relation.department.fl_str_mv -4297417259498638931
dc.relation.confidence.fl_str_mv 600
600
dc.relation.cnpq.fl_str_mv -2246940120554092110
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Alfenas
dc.publisher.program.fl_str_mv Programa de Pós-graduação em Física
dc.publisher.initials.fl_str_mv UNIFAL-MG
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv Instituto de Ciência e Tecnologia
publisher.none.fl_str_mv Universidade Federal de Alfenas
dc.source.none.fl_str_mv reponame:Repositório Institucional da Universidade Federal de Alfenas - RiUnifal
instname:Universidade Federal de Alfenas (UNIFAL)
instacron:UNIFAL
instname_str Universidade Federal de Alfenas (UNIFAL)
instacron_str UNIFAL
institution UNIFAL
reponame_str Repositório Institucional da Universidade Federal de Alfenas - RiUnifal
collection Repositório Institucional da Universidade Federal de Alfenas - RiUnifal
bitstream.url.fl_str_mv https://repositorio.unifal-mg.edu.br/bitstreams/886b802e-4b7b-466b-a5a0-ac66f29ed81c/download
https://repositorio.unifal-mg.edu.br/bitstreams/8706fc6b-e71c-4fe5-a9e9-f15f37f78508/download
https://repositorio.unifal-mg.edu.br/bitstreams/c53e2548-4e8e-4d57-a673-22f213efa9eb/download
https://repositorio.unifal-mg.edu.br/bitstreams/0ff9a04b-6b3b-49d9-b33b-9970fcf2f5e0/download
bitstream.checksum.fl_str_mv 31555718c4fc75849dd08f27935d4f6b
6b74c3f5d25ed14ab8739de6d4c5a152
2ce2ed3a88486fe342865dfa61f20929
ced89e1299926006932b42c29e3ba9aa
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da Universidade Federal de Alfenas - RiUnifal - Universidade Federal de Alfenas (UNIFAL)
repository.mail.fl_str_mv repositorio@unifal-mg.edu.br
_version_ 1859830902586081280