Small and time-efficient distribution-free predictive regions

Detalhes bibliográficos
Ano de defesa: 2023
Autor(a) principal: Reis, Victor Candido
Orientador(a): Izbicki, Rafael lattes
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: eng
Instituição de defesa: Universidade Federal de São Carlos
Câmpus São Carlos
Programa de Pós-Graduação: Programa Interinstitucional de Pós-Graduação em Estatística - PIPGEs
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: https://repositorio.ufscar.br/handle/20.500.14289/18077
Resumo: Predicting a target variable (response) is often the main objective of many studies and investigations. In such scenarios, there are usually other variables, known as covariates, that are more readily available and can assist in the prediction process. Regression and classification methods aim to utilize the statistical associations between all available information to model the variable of interest. During such modeling, there is a significant emphasis on estimating regions that describe the fluctuations of the response, allowing for the quantification of the uncertainty of point estimates. Conformal prediction methods (VOVK; GAMMERMAN; SHAFER, 2005) are a class of methods that aim to provide regions with general shapes and high probability guarantees, assuming only exchangeability, which is a weaker assumption than independent and identically distributed data. This allows for extensive use in various applications. New methodologies have been developed to improve the theoretical properties and applicability of the original ideas, with a practical perspective on execution and computational cost. Motivated by this context, this work aims to enrich the class of conformal prediction methods, with a particular focus on regression problems and proposes a new method that better utilizes available information, provides greater generality in the format of the regions, and is more efficient in terms of computational cost. The proposed method was compared with previous works using simulation studies, and it achieved competitive results.
id SCAR_6c77e6faacae5ea2b526de065bb791e1
oai_identifier_str oai:repositorio.ufscar.br:20.500.14289/18077
network_acronym_str SCAR
network_name_str Repositório Institucional da UFSCAR
repository_id_str
spelling Reis, Victor CandidoIzbicki, Rafaelhttp://lattes.cnpq.br/9991192137633896http://lattes.cnpq.br/2436861079295576ca595078-8fc4-4394-8b3a-d81e9fe4e53e2023-05-26T18:24:49Z2023-05-26T18:24:49Z2023-05-02REIS, Victor Candido. Small and time-efficient distribution-free predictive regions. 2023. Dissertação (Mestrado em Estatística) – Universidade Federal de São Carlos, São Carlos, 2023. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/18077.https://repositorio.ufscar.br/handle/20.500.14289/18077Predicting a target variable (response) is often the main objective of many studies and investigations. In such scenarios, there are usually other variables, known as covariates, that are more readily available and can assist in the prediction process. Regression and classification methods aim to utilize the statistical associations between all available information to model the variable of interest. During such modeling, there is a significant emphasis on estimating regions that describe the fluctuations of the response, allowing for the quantification of the uncertainty of point estimates. Conformal prediction methods (VOVK; GAMMERMAN; SHAFER, 2005) are a class of methods that aim to provide regions with general shapes and high probability guarantees, assuming only exchangeability, which is a weaker assumption than independent and identically distributed data. This allows for extensive use in various applications. New methodologies have been developed to improve the theoretical properties and applicability of the original ideas, with a practical perspective on execution and computational cost. Motivated by this context, this work aims to enrich the class of conformal prediction methods, with a particular focus on regression problems and proposes a new method that better utilizes available information, provides greater generality in the format of the regions, and is more efficient in terms of computational cost. The proposed method was compared with previous works using simulation studies, and it achieved competitive results.Frequentemente, prever uma variável alvo (resposta) é objeto de interesse de investigações e estudos. Nesse cenário, é comum existirem variáveis mais acessíveis (covariáveis) que podem ajudar no processo de previsão. Métodos de regressão e classificação surgem então com o objetivo de usar as associações estatísticas entre todas as informações disponíveis para modelar a variável de interesse. Há um grande foco, durante tal modelagem, em estimar regiões que descrevam a flutuação da resposta, possibilitando, por exemplo, quantificar a incerteza de estimativas pontuais. Conformal prediction é uma classe de métodos derivada de Vovk, Gammerman and Shafer (2005) que busca fornecer regiões com formas gerais e garantia de alta probabilidade, assumindo, basicamente, apenas permutabilidade das observações, suposição mais fraca do que dados independentes e identicamente distribuídos, o que permite seu uso extensivo. Novas metodologias têm sido desenvolvidas para aprimorar as propriedades teóricas dessa classe, bem como a aplicabilidade das ideias originais do ponto de vista prático de execução e custo computacional. Este trabalho objetivou enriquecer a classe de Conformal prediction com foco em problemas de regressão, propondo uma nova abordagem que reúne um melhor aproveitamento dos dados com uma maior generalidade no formato das regiões, em uma perspectiva de custo computacional mais eficiente. Resultados competitivos foram encontrados ao comparar o método proposto com trabalhos anteriores via estudos de simulação.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)88887.634340/2021-00engUniversidade Federal de São CarlosCâmpus São CarlosPrograma Interinstitucional de Pós-Graduação em Estatística - PIPGEsUFSCarAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessConformal predictionCIENCIAS EXATAS E DA TERRA::PROBABILIDADE E ESTATISTICA::ESTATISTICA::INFERENCIA NAO-PARAMETRICASmall and time-efficient distribution-free predictive regionsRegiões preditivas flexíveis, eficientes e livres-de-suposiçãoinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis6006003e57f161-19fe-4345-9e87-bc60eb7be98freponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINAL20230526_victor_dissertação_mestrado_conformal_revisada_ufscar.pdf20230526_victor_dissertação_mestrado_conformal_revisada_ufscar.pdfapplication/pdf480344https://repositorio.ufscar.br/bitstreams/d5ff2b11-6bc9-432f-8153-03fd567a182b/download98072d86e4857b84da23794976ec909cMD51trueAnonymousREADCC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8810https://repositorio.ufscar.br/bitstreams/0bce0810-538e-4457-9357-845a545d651b/downloadf337d95da1fce0a22c77480e5e9a7aecMD52falseAnonymousREADTEXT20230526_victor_dissertação_mestrado_conformal_revisada_ufscar.pdf.txt20230526_victor_dissertação_mestrado_conformal_revisada_ufscar.pdf.txtExtracted texttext/plain45872https://repositorio.ufscar.br/bitstreams/18b48d6c-afc1-4531-a92c-55cbdf18f8e9/download74e5be5d34238c08db6443f24fefb174MD53falseAnonymousREADTHUMBNAIL20230526_victor_dissertação_mestrado_conformal_revisada_ufscar.pdf.jpg20230526_victor_dissertação_mestrado_conformal_revisada_ufscar.pdf.jpgIM Thumbnailimage/jpeg15158https://repositorio.ufscar.br/bitstreams/29e4710c-db8c-41eb-8b8f-b5edc2fe4e3a/downloadae4fea3cdc66ce86cb217e266074e4afMD54falseAnonymousREAD20.500.14289/180772025-02-05 23:45:37.151http://creativecommons.org/licenses/by-nc-nd/3.0/br/Attribution-NonCommercial-NoDerivs 3.0 Brazilopen.accessoai:repositorio.ufscar.br:20.500.14289/18077https://repositorio.ufscar.brRepositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestrepositorio.sibi@ufscar.bropendoar:43222025-02-06T02:45:37Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false
dc.title.eng.fl_str_mv Small and time-efficient distribution-free predictive regions
dc.title.alternative.por.fl_str_mv Regiões preditivas flexíveis, eficientes e livres-de-suposição
title Small and time-efficient distribution-free predictive regions
spellingShingle Small and time-efficient distribution-free predictive regions
Reis, Victor Candido
Conformal prediction
CIENCIAS EXATAS E DA TERRA::PROBABILIDADE E ESTATISTICA::ESTATISTICA::INFERENCIA NAO-PARAMETRICA
title_short Small and time-efficient distribution-free predictive regions
title_full Small and time-efficient distribution-free predictive regions
title_fullStr Small and time-efficient distribution-free predictive regions
title_full_unstemmed Small and time-efficient distribution-free predictive regions
title_sort Small and time-efficient distribution-free predictive regions
author Reis, Victor Candido
author_facet Reis, Victor Candido
author_role author
dc.contributor.authorlattes.por.fl_str_mv http://lattes.cnpq.br/2436861079295576
dc.contributor.author.fl_str_mv Reis, Victor Candido
dc.contributor.advisor1.fl_str_mv Izbicki, Rafael
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/9991192137633896
dc.contributor.authorID.fl_str_mv ca595078-8fc4-4394-8b3a-d81e9fe4e53e
contributor_str_mv Izbicki, Rafael
dc.subject.eng.fl_str_mv Conformal prediction
topic Conformal prediction
CIENCIAS EXATAS E DA TERRA::PROBABILIDADE E ESTATISTICA::ESTATISTICA::INFERENCIA NAO-PARAMETRICA
dc.subject.cnpq.fl_str_mv CIENCIAS EXATAS E DA TERRA::PROBABILIDADE E ESTATISTICA::ESTATISTICA::INFERENCIA NAO-PARAMETRICA
description Predicting a target variable (response) is often the main objective of many studies and investigations. In such scenarios, there are usually other variables, known as covariates, that are more readily available and can assist in the prediction process. Regression and classification methods aim to utilize the statistical associations between all available information to model the variable of interest. During such modeling, there is a significant emphasis on estimating regions that describe the fluctuations of the response, allowing for the quantification of the uncertainty of point estimates. Conformal prediction methods (VOVK; GAMMERMAN; SHAFER, 2005) are a class of methods that aim to provide regions with general shapes and high probability guarantees, assuming only exchangeability, which is a weaker assumption than independent and identically distributed data. This allows for extensive use in various applications. New methodologies have been developed to improve the theoretical properties and applicability of the original ideas, with a practical perspective on execution and computational cost. Motivated by this context, this work aims to enrich the class of conformal prediction methods, with a particular focus on regression problems and proposes a new method that better utilizes available information, provides greater generality in the format of the regions, and is more efficient in terms of computational cost. The proposed method was compared with previous works using simulation studies, and it achieved competitive results.
publishDate 2023
dc.date.accessioned.fl_str_mv 2023-05-26T18:24:49Z
dc.date.available.fl_str_mv 2023-05-26T18:24:49Z
dc.date.issued.fl_str_mv 2023-05-02
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv REIS, Victor Candido. Small and time-efficient distribution-free predictive regions. 2023. Dissertação (Mestrado em Estatística) – Universidade Federal de São Carlos, São Carlos, 2023. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/18077.
dc.identifier.uri.fl_str_mv https://repositorio.ufscar.br/handle/20.500.14289/18077
identifier_str_mv REIS, Victor Candido. Small and time-efficient distribution-free predictive regions. 2023. Dissertação (Mestrado em Estatística) – Universidade Federal de São Carlos, São Carlos, 2023. Disponível em: https://repositorio.ufscar.br/handle/20.500.14289/18077.
url https://repositorio.ufscar.br/handle/20.500.14289/18077
dc.language.iso.fl_str_mv eng
language eng
dc.relation.confidence.fl_str_mv 600
600
dc.relation.authority.fl_str_mv 3e57f161-19fe-4345-9e87-bc60eb7be98f
dc.rights.driver.fl_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de São Carlos
Câmpus São Carlos
dc.publisher.program.fl_str_mv Programa Interinstitucional de Pós-Graduação em Estatística - PIPGEs
dc.publisher.initials.fl_str_mv UFSCar
publisher.none.fl_str_mv Universidade Federal de São Carlos
Câmpus São Carlos
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFSCAR
instname:Universidade Federal de São Carlos (UFSCAR)
instacron:UFSCAR
instname_str Universidade Federal de São Carlos (UFSCAR)
instacron_str UFSCAR
institution UFSCAR
reponame_str Repositório Institucional da UFSCAR
collection Repositório Institucional da UFSCAR
bitstream.url.fl_str_mv https://repositorio.ufscar.br/bitstreams/d5ff2b11-6bc9-432f-8153-03fd567a182b/download
https://repositorio.ufscar.br/bitstreams/0bce0810-538e-4457-9357-845a545d651b/download
https://repositorio.ufscar.br/bitstreams/18b48d6c-afc1-4531-a92c-55cbdf18f8e9/download
https://repositorio.ufscar.br/bitstreams/29e4710c-db8c-41eb-8b8f-b5edc2fe4e3a/download
bitstream.checksum.fl_str_mv 98072d86e4857b84da23794976ec909c
f337d95da1fce0a22c77480e5e9a7aec
74e5be5d34238c08db6443f24fefb174
ae4fea3cdc66ce86cb217e266074e4af
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv repositorio.sibi@ufscar.br
_version_ 1851688909823541248