Analysis of natural disasters in data from news

Detalhes bibliográficos
Ano de defesa: 2024
Autor(a) principal: Garcia, Klaifer [UNIFESP]
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
dARK ID: ark:/48912/001300002ssws
Idioma: eng
Instituição de defesa: Universidade Federal de São Paulo
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://hdl.handle.net/11600/72677
Resumo: Natural disasters have been occurring with increasing frequency as a result of human activity on the environment, causing significant damage to society. Minimizing these losses depends on the development of protection policies, which need to be supported by accurate information about the events. However, collecting information on disasters presents several challenges, such as insufficient manpower to document every detail of the event and the unpredictability of the events, making it difficult to capture the initial moments after a disaster. In light of these challenges, this work developed methodologies to utilize news data as an alternative source of information on disasters. Specifically, techniques for document filtering, event detection, and automatic summarization were proposed and optimized to achieve better results in this domain, with a particular focus on improving applications in Portuguese, as there is a shortage of research in this language. The main contributions of this work are: 1) a complete framework for building knowledge bases from news articles, 2) new Portuguese datasets for several Natural Language Processing (NLP) tasks, 3) a novel method to produce more accurate summaries based on siamese networks, 4) an evaluation of the latest text classification techniques for application in Portuguese, and 5) a systematic literature review on event detection in news. This work provides contributions to various NLP tasks, with a special emphasis on addressing and developing solutions for the Portuguese language.
id UFSP_9e938335fbe41b9acafbc9c9ae6036e3
oai_identifier_str oai:repositorio.unifesp.br:11600/72677
network_acronym_str UFSP
network_name_str Repositório Institucional da UNIFESP
repository_id_str
spelling Analysis of natural disasters in data from newsAnálise de desastres naturais em dados de notíciasNatural Language ProcessingAutomatic Text SummarizationEvent DetectionAutomatic Text ClassificationMachine LearningNatural disasters have been occurring with increasing frequency as a result of human activity on the environment, causing significant damage to society. Minimizing these losses depends on the development of protection policies, which need to be supported by accurate information about the events. However, collecting information on disasters presents several challenges, such as insufficient manpower to document every detail of the event and the unpredictability of the events, making it difficult to capture the initial moments after a disaster. In light of these challenges, this work developed methodologies to utilize news data as an alternative source of information on disasters. Specifically, techniques for document filtering, event detection, and automatic summarization were proposed and optimized to achieve better results in this domain, with a particular focus on improving applications in Portuguese, as there is a shortage of research in this language. The main contributions of this work are: 1) a complete framework for building knowledge bases from news articles, 2) new Portuguese datasets for several Natural Language Processing (NLP) tasks, 3) a novel method to produce more accurate summaries based on siamese networks, 4) an evaluation of the latest text classification techniques for application in Portuguese, and 5) a systematic literature review on event detection in news. This work provides contributions to various NLP tasks, with a special emphasis on addressing and developing solutions for the Portuguese language.Universidade Federal de São PauloBerton, Lilian [UNIFESP]http://lattes.cnpq.br/9064767888093340http://lattes.cnpq.br/0896350174589757Garcia, Klaifer [UNIFESP]2024-12-30T13:24:58Z2024-12-30T13:24:58Z2024-11-25info:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/publishedVersion149 f.application/pdfhttps://hdl.handle.net/11600/72677ark:/48912/001300002sswsengSão José dos Campos, SPinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UNIFESPinstname:Universidade Federal de São Paulo (UNIFESP)instacron:UNIFESP2024-12-31T04:01:32Zoai:repositorio.unifesp.br:11600/72677Repositório InstitucionalPUBhttp://www.repositorio.unifesp.br/oai/requestbiblioteca.csp@unifesp.bropendoar:34652024-12-31T04:01:32Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)false
dc.title.none.fl_str_mv Analysis of natural disasters in data from news
Análise de desastres naturais em dados de notícias
title Analysis of natural disasters in data from news
spellingShingle Analysis of natural disasters in data from news
Garcia, Klaifer [UNIFESP]
Natural Language Processing
Automatic Text Summarization
Event Detection
Automatic Text Classification
Machine Learning
title_short Analysis of natural disasters in data from news
title_full Analysis of natural disasters in data from news
title_fullStr Analysis of natural disasters in data from news
title_full_unstemmed Analysis of natural disasters in data from news
title_sort Analysis of natural disasters in data from news
author Garcia, Klaifer [UNIFESP]
author_facet Garcia, Klaifer [UNIFESP]
author_role author
dc.contributor.none.fl_str_mv Berton, Lilian [UNIFESP]
http://lattes.cnpq.br/9064767888093340
http://lattes.cnpq.br/0896350174589757
dc.contributor.author.fl_str_mv Garcia, Klaifer [UNIFESP]
dc.subject.por.fl_str_mv Natural Language Processing
Automatic Text Summarization
Event Detection
Automatic Text Classification
Machine Learning
topic Natural Language Processing
Automatic Text Summarization
Event Detection
Automatic Text Classification
Machine Learning
description Natural disasters have been occurring with increasing frequency as a result of human activity on the environment, causing significant damage to society. Minimizing these losses depends on the development of protection policies, which need to be supported by accurate information about the events. However, collecting information on disasters presents several challenges, such as insufficient manpower to document every detail of the event and the unpredictability of the events, making it difficult to capture the initial moments after a disaster. In light of these challenges, this work developed methodologies to utilize news data as an alternative source of information on disasters. Specifically, techniques for document filtering, event detection, and automatic summarization were proposed and optimized to achieve better results in this domain, with a particular focus on improving applications in Portuguese, as there is a shortage of research in this language. The main contributions of this work are: 1) a complete framework for building knowledge bases from news articles, 2) new Portuguese datasets for several Natural Language Processing (NLP) tasks, 3) a novel method to produce more accurate summaries based on siamese networks, 4) an evaluation of the latest text classification techniques for application in Portuguese, and 5) a systematic literature review on event detection in news. This work provides contributions to various NLP tasks, with a special emphasis on addressing and developing solutions for the Portuguese language.
publishDate 2024
dc.date.none.fl_str_mv 2024-12-30T13:24:58Z
2024-12-30T13:24:58Z
2024-11-25
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/11600/72677
dc.identifier.dark.fl_str_mv ark:/48912/001300002ssws
url https://hdl.handle.net/11600/72677
identifier_str_mv ark:/48912/001300002ssws
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 149 f.
application/pdf
dc.coverage.none.fl_str_mv São José dos Campos, SP
dc.publisher.none.fl_str_mv Universidade Federal de São Paulo
publisher.none.fl_str_mv Universidade Federal de São Paulo
dc.source.none.fl_str_mv reponame:Repositório Institucional da UNIFESP
instname:Universidade Federal de São Paulo (UNIFESP)
instacron:UNIFESP
instname_str Universidade Federal de São Paulo (UNIFESP)
instacron_str UNIFESP
institution UNIFESP
reponame_str Repositório Institucional da UNIFESP
collection Repositório Institucional da UNIFESP
repository.name.fl_str_mv Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)
repository.mail.fl_str_mv biblioteca.csp@unifesp.br
_version_ 1848498050461335552