Analysis of natural disasters in data from news
| Ano de defesa: | 2024 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Tese |
| Tipo de acesso: | Acesso aberto |
| dARK ID: | ark:/48912/001300002ssws |
| Idioma: | eng |
| Instituição de defesa: |
Universidade Federal de São Paulo
|
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Link de acesso: | https://hdl.handle.net/11600/72677 |
Resumo: | Natural disasters have been occurring with increasing frequency as a result of human activity on the environment, causing significant damage to society. Minimizing these losses depends on the development of protection policies, which need to be supported by accurate information about the events. However, collecting information on disasters presents several challenges, such as insufficient manpower to document every detail of the event and the unpredictability of the events, making it difficult to capture the initial moments after a disaster. In light of these challenges, this work developed methodologies to utilize news data as an alternative source of information on disasters. Specifically, techniques for document filtering, event detection, and automatic summarization were proposed and optimized to achieve better results in this domain, with a particular focus on improving applications in Portuguese, as there is a shortage of research in this language. The main contributions of this work are: 1) a complete framework for building knowledge bases from news articles, 2) new Portuguese datasets for several Natural Language Processing (NLP) tasks, 3) a novel method to produce more accurate summaries based on siamese networks, 4) an evaluation of the latest text classification techniques for application in Portuguese, and 5) a systematic literature review on event detection in news. This work provides contributions to various NLP tasks, with a special emphasis on addressing and developing solutions for the Portuguese language. |
| id |
UFSP_9e938335fbe41b9acafbc9c9ae6036e3 |
|---|---|
| oai_identifier_str |
oai:repositorio.unifesp.br:11600/72677 |
| network_acronym_str |
UFSP |
| network_name_str |
Repositório Institucional da UNIFESP |
| repository_id_str |
|
| spelling |
Analysis of natural disasters in data from newsAnálise de desastres naturais em dados de notíciasNatural Language ProcessingAutomatic Text SummarizationEvent DetectionAutomatic Text ClassificationMachine LearningNatural disasters have been occurring with increasing frequency as a result of human activity on the environment, causing significant damage to society. Minimizing these losses depends on the development of protection policies, which need to be supported by accurate information about the events. However, collecting information on disasters presents several challenges, such as insufficient manpower to document every detail of the event and the unpredictability of the events, making it difficult to capture the initial moments after a disaster. In light of these challenges, this work developed methodologies to utilize news data as an alternative source of information on disasters. Specifically, techniques for document filtering, event detection, and automatic summarization were proposed and optimized to achieve better results in this domain, with a particular focus on improving applications in Portuguese, as there is a shortage of research in this language. The main contributions of this work are: 1) a complete framework for building knowledge bases from news articles, 2) new Portuguese datasets for several Natural Language Processing (NLP) tasks, 3) a novel method to produce more accurate summaries based on siamese networks, 4) an evaluation of the latest text classification techniques for application in Portuguese, and 5) a systematic literature review on event detection in news. This work provides contributions to various NLP tasks, with a special emphasis on addressing and developing solutions for the Portuguese language.Universidade Federal de São PauloBerton, Lilian [UNIFESP]http://lattes.cnpq.br/9064767888093340http://lattes.cnpq.br/0896350174589757Garcia, Klaifer [UNIFESP]2024-12-30T13:24:58Z2024-12-30T13:24:58Z2024-11-25info:eu-repo/semantics/doctoralThesisinfo:eu-repo/semantics/publishedVersion149 f.application/pdfhttps://hdl.handle.net/11600/72677ark:/48912/001300002sswsengSão José dos Campos, SPinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UNIFESPinstname:Universidade Federal de São Paulo (UNIFESP)instacron:UNIFESP2024-12-31T04:01:32Zoai:repositorio.unifesp.br:11600/72677Repositório InstitucionalPUBhttp://www.repositorio.unifesp.br/oai/requestbiblioteca.csp@unifesp.bropendoar:34652024-12-31T04:01:32Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP)false |
| dc.title.none.fl_str_mv |
Analysis of natural disasters in data from news Análise de desastres naturais em dados de notícias |
| title |
Analysis of natural disasters in data from news |
| spellingShingle |
Analysis of natural disasters in data from news Garcia, Klaifer [UNIFESP] Natural Language Processing Automatic Text Summarization Event Detection Automatic Text Classification Machine Learning |
| title_short |
Analysis of natural disasters in data from news |
| title_full |
Analysis of natural disasters in data from news |
| title_fullStr |
Analysis of natural disasters in data from news |
| title_full_unstemmed |
Analysis of natural disasters in data from news |
| title_sort |
Analysis of natural disasters in data from news |
| author |
Garcia, Klaifer [UNIFESP] |
| author_facet |
Garcia, Klaifer [UNIFESP] |
| author_role |
author |
| dc.contributor.none.fl_str_mv |
Berton, Lilian [UNIFESP] http://lattes.cnpq.br/9064767888093340 http://lattes.cnpq.br/0896350174589757 |
| dc.contributor.author.fl_str_mv |
Garcia, Klaifer [UNIFESP] |
| dc.subject.por.fl_str_mv |
Natural Language Processing Automatic Text Summarization Event Detection Automatic Text Classification Machine Learning |
| topic |
Natural Language Processing Automatic Text Summarization Event Detection Automatic Text Classification Machine Learning |
| description |
Natural disasters have been occurring with increasing frequency as a result of human activity on the environment, causing significant damage to society. Minimizing these losses depends on the development of protection policies, which need to be supported by accurate information about the events. However, collecting information on disasters presents several challenges, such as insufficient manpower to document every detail of the event and the unpredictability of the events, making it difficult to capture the initial moments after a disaster. In light of these challenges, this work developed methodologies to utilize news data as an alternative source of information on disasters. Specifically, techniques for document filtering, event detection, and automatic summarization were proposed and optimized to achieve better results in this domain, with a particular focus on improving applications in Portuguese, as there is a shortage of research in this language. The main contributions of this work are: 1) a complete framework for building knowledge bases from news articles, 2) new Portuguese datasets for several Natural Language Processing (NLP) tasks, 3) a novel method to produce more accurate summaries based on siamese networks, 4) an evaluation of the latest text classification techniques for application in Portuguese, and 5) a systematic literature review on event detection in news. This work provides contributions to various NLP tasks, with a special emphasis on addressing and developing solutions for the Portuguese language. |
| publishDate |
2024 |
| dc.date.none.fl_str_mv |
2024-12-30T13:24:58Z 2024-12-30T13:24:58Z 2024-11-25 |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| format |
doctoralThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/11600/72677 |
| dc.identifier.dark.fl_str_mv |
ark:/48912/001300002ssws |
| url |
https://hdl.handle.net/11600/72677 |
| identifier_str_mv |
ark:/48912/001300002ssws |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
149 f. application/pdf |
| dc.coverage.none.fl_str_mv |
São José dos Campos, SP |
| dc.publisher.none.fl_str_mv |
Universidade Federal de São Paulo |
| publisher.none.fl_str_mv |
Universidade Federal de São Paulo |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UNIFESP instname:Universidade Federal de São Paulo (UNIFESP) instacron:UNIFESP |
| instname_str |
Universidade Federal de São Paulo (UNIFESP) |
| instacron_str |
UNIFESP |
| institution |
UNIFESP |
| reponame_str |
Repositório Institucional da UNIFESP |
| collection |
Repositório Institucional da UNIFESP |
| repository.name.fl_str_mv |
Repositório Institucional da UNIFESP - Universidade Federal de São Paulo (UNIFESP) |
| repository.mail.fl_str_mv |
biblioteca.csp@unifesp.br |
| _version_ |
1848498050461335552 |