Online Large-Scale Hypothesis Tesng with Corrupted Data

Detalhes bibliográficos
Ano de defesa: 2024
Autor(a) principal: Alves, Victor Benicio Ardilha da Silva
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: eng
Instituição de defesa: Naval Postgraduate School (NVS)
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://www.repositorio.mar.mil.br/handle/ripcmb/847104
Resumo: Modern statistical inference involves processing extensive datasets, with multiple hypothesis testing being one methodology to draw conclusions on many features at once. Control of the false discovery rate (FDR) is essential. Target classification via satellite imagery and acoustic signal processing are examples in military applications where false detections can be costly for a Command and Control framework. These contexts also showcase another layer of complexity: the volume of data is often processed online, with decisions having to be made sequentially on evolving, incomplete datasets. This underscores the need for FDR control in an online environment. Current methods for online FDR control are successful in this regard; however, they are not designed with data error or, worse, data corruption in mind. This research will explore the level of robustness of the Levels Based On Recent Discovery (LORD) algorithm. The fundamental objective is to learn how to corrupt data and make it robust against such corruption efficiently. This work will draw insights from studying corruption-robust bandit algorithms and aim to advance the adversarial online multiple-hypothesis testing field.
id MB_6253d958a1c38ce0b34cc57c9ff8788d
oai_identifier_str oai:www.repositorio.mar.mil.br:ripcmb/847104
network_acronym_str MB
network_name_str Repositório Institucional da Produção Científica da Marinha do Brasil (RI-MB)
repository_id_str
spelling Online Large-Scale Hypothesis Tesng with Corrupted DataFalse discovery ratePowerData corruponCascading effectEngenharia de produçãoDiretoria-Geral do Material da Marinha (DGMM)Modern statistical inference involves processing extensive datasets, with multiple hypothesis testing being one methodology to draw conclusions on many features at once. Control of the false discovery rate (FDR) is essential. Target classification via satellite imagery and acoustic signal processing are examples in military applications where false detections can be costly for a Command and Control framework. These contexts also showcase another layer of complexity: the volume of data is often processed online, with decisions having to be made sequentially on evolving, incomplete datasets. This underscores the need for FDR control in an online environment. Current methods for online FDR control are successful in this regard; however, they are not designed with data error or, worse, data corruption in mind. This research will explore the level of robustness of the Levels Based On Recent Discovery (LORD) algorithm. The fundamental objective is to learn how to corrupt data and make it robust against such corruption efficiently. This work will draw insights from studying corruption-robust bandit algorithms and aim to advance the adversarial online multiple-hypothesis testing field.Naval Postgraduate School (NVS)Szechtman, RobertoChen, LouisAlves, Victor Benicio Ardilha da Silva2024-09-19T14:20:23Z2024-09-19T14:20:23Z2024info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://www.repositorio.mar.mil.br/handle/ripcmb/847104info:eu-repo/semantics/openAccessengreponame:Repositório Institucional da Produção Científica da Marinha do Brasil (RI-MB)instname:Marinha do Brasil (MB)instacron:MB2025-08-26T18:42:24Zoai:www.repositorio.mar.mil.br:ripcmb/847104Repositório InstitucionalPUBhttps://www.repositorio.mar.mil.br/oai/requestdphdm.repositorio@marinha.mil.bropendoar:2025-08-26T18:42:24Repositório Institucional da Produção Científica da Marinha do Brasil (RI-MB) - Marinha do Brasil (MB)false
dc.title.none.fl_str_mv Online Large-Scale Hypothesis Tesng with Corrupted Data
title Online Large-Scale Hypothesis Tesng with Corrupted Data
spellingShingle Online Large-Scale Hypothesis Tesng with Corrupted Data
Alves, Victor Benicio Ardilha da Silva
False discovery rate
Power
Data corrupon
Cascading effect
Engenharia de produção
Diretoria-Geral do Material da Marinha (DGMM)
title_short Online Large-Scale Hypothesis Tesng with Corrupted Data
title_full Online Large-Scale Hypothesis Tesng with Corrupted Data
title_fullStr Online Large-Scale Hypothesis Tesng with Corrupted Data
title_full_unstemmed Online Large-Scale Hypothesis Tesng with Corrupted Data
title_sort Online Large-Scale Hypothesis Tesng with Corrupted Data
author Alves, Victor Benicio Ardilha da Silva
author_facet Alves, Victor Benicio Ardilha da Silva
author_role author
dc.contributor.none.fl_str_mv Szechtman, Roberto
Chen, Louis
dc.contributor.author.fl_str_mv Alves, Victor Benicio Ardilha da Silva
dc.subject.por.fl_str_mv False discovery rate
Power
Data corrupon
Cascading effect
Engenharia de produção
Diretoria-Geral do Material da Marinha (DGMM)
topic False discovery rate
Power
Data corrupon
Cascading effect
Engenharia de produção
Diretoria-Geral do Material da Marinha (DGMM)
description Modern statistical inference involves processing extensive datasets, with multiple hypothesis testing being one methodology to draw conclusions on many features at once. Control of the false discovery rate (FDR) is essential. Target classification via satellite imagery and acoustic signal processing are examples in military applications where false detections can be costly for a Command and Control framework. These contexts also showcase another layer of complexity: the volume of data is often processed online, with decisions having to be made sequentially on evolving, incomplete datasets. This underscores the need for FDR control in an online environment. Current methods for online FDR control are successful in this regard; however, they are not designed with data error or, worse, data corruption in mind. This research will explore the level of robustness of the Levels Based On Recent Discovery (LORD) algorithm. The fundamental objective is to learn how to corrupt data and make it robust against such corruption efficiently. This work will draw insights from studying corruption-robust bandit algorithms and aim to advance the adversarial online multiple-hypothesis testing field.
publishDate 2024
dc.date.none.fl_str_mv 2024-09-19T14:20:23Z
2024-09-19T14:20:23Z
2024
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.repositorio.mar.mil.br/handle/ripcmb/847104
url https://www.repositorio.mar.mil.br/handle/ripcmb/847104
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Naval Postgraduate School (NVS)
publisher.none.fl_str_mv Naval Postgraduate School (NVS)
dc.source.none.fl_str_mv reponame:Repositório Institucional da Produção Científica da Marinha do Brasil (RI-MB)
instname:Marinha do Brasil (MB)
instacron:MB
instname_str Marinha do Brasil (MB)
instacron_str MB
institution MB
reponame_str Repositório Institucional da Produção Científica da Marinha do Brasil (RI-MB)
collection Repositório Institucional da Produção Científica da Marinha do Brasil (RI-MB)
repository.name.fl_str_mv Repositório Institucional da Produção Científica da Marinha do Brasil (RI-MB) - Marinha do Brasil (MB)
repository.mail.fl_str_mv dphdm.repositorio@marinha.mil.br
_version_ 1855762816019988480