Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry
| Ano de defesa: | 2022 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Tese |
| Tipo de acesso: | Acesso aberto |
| Idioma: | eng |
| Instituição de defesa: |
Universidade Federal de Pernambuco
|
| Programa de Pós-Graduação: |
Programa de Pos Graduacao em Engenharia de Producao
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Brasil
|
| Palavras-chave em Português: | |
| Link de acesso: | https://repositorio.ufpe.br/handle/123456789/48492 |
Resumo: | Risk Analysis (RA) is crucial to prevent and mitigate potential risk events; however, there are several challenges related to RA. For instance, accident investigation reports are useful sources of information to support safety professionals to propose measures to prevent or mitigate identified occupational accident root causes. Nevertheless, reports’ low quality and lack of detail may limit their usefulness. Moreover, the quality of Quantitative Risk Analysis (QRA) strongly relies on the identification of all potential hazards with major consequences related to the operation of an industrial system, which is usually performed by multiple experts and consumes a considerable amount of time and effort. Since valuable knowledge about an industrial system is stored in the form of textual data, Natural Language Processing (NLP) techniques can be helpful since it can be applied to extract, organize, and classify information from text. Although several studies contributed to the advance of RA, most studies applying NLP focus primarily on automatically identifying patterns from reactive data, such as accident reports, and do not consider the quality of information contained in these documents. In addition, different forms of text data store relevant knowledge about industrial systems and their respective risks, especially proactive data such as documents resulting from preliminary risk studies, and adoption of these data could support preventive risk studies. The main purpose of this study is to develop NLP-based solutions to different issues faced in RA. Thus, this thesis presents two methodologies to (i) identify issues in a hydropower company’s accident investigation reports that may compromise their usefulness as a decision support tool (ii) automatically identify risk features from documents to support the initial stage of QRA in Oil and Gas (O&G) industries. Occupational safety technicians can benefit from the methodology that helps to identify issues and propose improvements to the accident reports. In addition, the second methodology can help experts to identify and assess hypothetical accidental scenarios related to the operation of an industrial facility. Thus, this thesis may contribute to the prevention and mitigation of occupational and/or major accidents and consequently avoid/reduce property damage, economic and social disruption, environmental degradation, and human losses. |
| id |
UFPE_bfa2fdf1c936bd90e46ae78a08338d71 |
|---|---|
| oai_identifier_str |
oai:repositorio.ufpe.br:123456789/48492 |
| network_acronym_str |
UFPE |
| network_name_str |
Repositório Institucional da UFPE |
| repository_id_str |
|
| spelling |
MACÊDO, July Biashttp://lattes.cnpq.br/2540702750653143http://lattes.cnpq.br/7778828466828647MOURA, Márcio José das ChagasZIO, Enrico2023-01-03T13:19:49Z2023-01-03T13:19:49Z2022-12-20MACÊDO, July Bias. Development of natural language processing-based solutions for risk analysis: application to a hydropower company and an O&G industry. 2022. Tese (Doutorado em Engenharia de Produção) – Universidade Federal de Pernambuco, Recife, 2022.https://repositorio.ufpe.br/handle/123456789/48492Risk Analysis (RA) is crucial to prevent and mitigate potential risk events; however, there are several challenges related to RA. For instance, accident investigation reports are useful sources of information to support safety professionals to propose measures to prevent or mitigate identified occupational accident root causes. Nevertheless, reports’ low quality and lack of detail may limit their usefulness. Moreover, the quality of Quantitative Risk Analysis (QRA) strongly relies on the identification of all potential hazards with major consequences related to the operation of an industrial system, which is usually performed by multiple experts and consumes a considerable amount of time and effort. Since valuable knowledge about an industrial system is stored in the form of textual data, Natural Language Processing (NLP) techniques can be helpful since it can be applied to extract, organize, and classify information from text. Although several studies contributed to the advance of RA, most studies applying NLP focus primarily on automatically identifying patterns from reactive data, such as accident reports, and do not consider the quality of information contained in these documents. In addition, different forms of text data store relevant knowledge about industrial systems and their respective risks, especially proactive data such as documents resulting from preliminary risk studies, and adoption of these data could support preventive risk studies. The main purpose of this study is to develop NLP-based solutions to different issues faced in RA. Thus, this thesis presents two methodologies to (i) identify issues in a hydropower company’s accident investigation reports that may compromise their usefulness as a decision support tool (ii) automatically identify risk features from documents to support the initial stage of QRA in Oil and Gas (O&G) industries. Occupational safety technicians can benefit from the methodology that helps to identify issues and propose improvements to the accident reports. In addition, the second methodology can help experts to identify and assess hypothetical accidental scenarios related to the operation of an industrial facility. Thus, this thesis may contribute to the prevention and mitigation of occupational and/or major accidents and consequently avoid/reduce property damage, economic and social disruption, environmental degradation, and human losses.CAPESFACEPECNPqA Análise de Riscos (RA) é essencial para a prevenção e mitigação de potenciais eventos de risco, porém há vários desafios relacionados à execução da análise. Por exemplo, relatórios de acidentes, são fontes úteis de informação para apoiar os especialistas de segurança a propor medidas preventivas/mitigativas das causas acidentais ocupacionais identificadas. Porém, a falta de detalhes e a baixa qualidade dos relatórios podem limitar a sua utilidade. Além disso, a qualidade da Análise Quantitativa de Risco (QRA) depende fortemente da identificação de todos os potenciais perigos com consequências graves, relacionados à operação do sistema industrial, o que consome uma quantidade considerável de tempo e esforço. Nesse contexto, o Processamento de Linguagem Natural (NLP) pode ser útil pois pode ser aplicado para extrair, organizar e classificar a informação do texto. Embora vários estudos tenham contribuído para o avanço da RA, a maior parte dos estudos que aplicam NLP à RA foca principalmente na identificação automática de padrões a partir de dados reativos, tais como relatórios de acidentes, e não consideram a qualidade da informação contida nestes documentos. Além disso, diferentes formas de dados de texto armazenam conhecimento relevante sobre os sistemas industriais e seus respectivos riscos, especialmente dados proativos, como documentos resultantes de estudos preliminares de riscos, e a adoção desses dados poderia apoiar estudos de risco preventivos. Por isso, esta tese apresenta duas metodologias baseadas em NLP para (i) identificar problemas em relatórios de acidentes que possam comprometer a utilidade desses documentos como ferramenta de suporte a decisão e (ii) para identificar características de risco a partir de documentos para apoiar a fase inicial da QRA. A primeira metodologia dá suporte aos técnicos de segurança para identificar problemas e propor melhorias/correções nos relatórios de acidente, contribuindo para uma melhor gestão de acidentes ocupacionais. Além disso a segunda metodologia pode auxiliar especialistas a identificar e avaliar cenários acidentais relacionados a operação de um sistema industrial. Dessa forma essa tese contribui para a prevenção e mitigação de acidentes e consequentemente evita/reduz danos a propriedade, econômicos e sociais, degradação ambiental e perdas humanas.engUniversidade Federal de PernambucoPrograma de Pos Graduacao em Engenharia de ProducaoUFPEBrasilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessEngenharia de produçãoAnálise de riscosRelatório de acidentesProcessamento de linguagem naturalMineração de textoRefinaria de petróleoCompanhia hidroelétricaDevelopment of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industryinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisdoutoradoreponame:Repositório Institucional da UFPEinstname:Universidade Federal de Pernambuco (UFPE)instacron:UFPECC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufpe.br/bitstream/123456789/48492/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52ORIGINALTESE July Bias Macêdo.pdfTESE July Bias Macêdo.pdfapplication/pdf2997999https://repositorio.ufpe.br/bitstream/123456789/48492/1/TESE%20July%20Bias%20Mac%c3%aado.pdf9f6996ad5784a31c1f8093a9a61ce539MD51LICENSElicense.txtlicense.txttext/plain; charset=utf-82362https://repositorio.ufpe.br/bitstream/123456789/48492/3/license.txt5e89a1613ddc8510c6576f4b23a78973MD53TEXTTESE July Bias Macêdo.pdf.txtTESE July Bias Macêdo.pdf.txtExtracted texttext/plain283790https://repositorio.ufpe.br/bitstream/123456789/48492/4/TESE%20July%20Bias%20Mac%c3%aado.pdf.txt499a39497f5b6f0cdda0e0f4055a5a1dMD54THUMBNAILTESE July Bias Macêdo.pdf.jpgTESE July Bias Macêdo.pdf.jpgGenerated Thumbnailimage/jpeg1244https://repositorio.ufpe.br/bitstream/123456789/48492/5/TESE%20July%20Bias%20Mac%c3%aado.pdf.jpgb9070f8e065cb7aec44fb42c632d02beMD55123456789/484922023-01-04 02:22:51.163oai:repositorio.ufpe.br:123456789/48492VGVybW8gZGUgRGVww7NzaXRvIExlZ2FsIGUgQXV0b3JpemHDp8OjbyBwYXJhIFB1YmxpY2l6YcOnw6NvIGRlIERvY3VtZW50b3Mgbm8gUmVwb3NpdMOzcmlvIERpZ2l0YWwgZGEgVUZQRQoKCkRlY2xhcm8gZXN0YXIgY2llbnRlIGRlIHF1ZSBlc3RlIFRlcm1vIGRlIERlcMOzc2l0byBMZWdhbCBlIEF1dG9yaXphw6fDo28gdGVtIG8gb2JqZXRpdm8gZGUgZGl2dWxnYcOnw6NvIGRvcyBkb2N1bWVudG9zIGRlcG9zaXRhZG9zIG5vIFJlcG9zaXTDs3JpbyBEaWdpdGFsIGRhIFVGUEUgZSBkZWNsYXJvIHF1ZToKCkkgLSBvcyBkYWRvcyBwcmVlbmNoaWRvcyBubyBmb3JtdWzDoXJpbyBkZSBkZXDDs3NpdG8gc8OjbyB2ZXJkYWRlaXJvcyBlIGF1dMOqbnRpY29zOwoKSUkgLSAgbyBjb250ZcO6ZG8gZGlzcG9uaWJpbGl6YWRvIMOpIGRlIHJlc3BvbnNhYmlsaWRhZGUgZGUgc3VhIGF1dG9yaWE7CgpJSUkgLSBvIGNvbnRlw7pkbyDDqSBvcmlnaW5hbCwgZSBzZSBvIHRyYWJhbGhvIGUvb3UgcGFsYXZyYXMgZGUgb3V0cmFzIHBlc3NvYXMgZm9yYW0gdXRpbGl6YWRvcywgZXN0YXMgZm9yYW0gZGV2aWRhbWVudGUgcmVjb25oZWNpZGFzOwoKSVYgLSBxdWFuZG8gdHJhdGFyLXNlIGRlIG9icmEgY29sZXRpdmEgKG1haXMgZGUgdW0gYXV0b3IpOiB0b2RvcyBvcyBhdXRvcmVzIGVzdMOjbyBjaWVudGVzIGRvIGRlcMOzc2l0byBlIGRlIGFjb3JkbyBjb20gZXN0ZSB0ZXJtbzsKClYgLSBxdWFuZG8gdHJhdGFyLXNlIGRlIFRyYWJhbGhvIGRlIENvbmNsdXPDo28gZGUgQ3Vyc28sIERpc3NlcnRhw6fDo28gb3UgVGVzZTogbyBhcnF1aXZvIGRlcG9zaXRhZG8gY29ycmVzcG9uZGUgw6AgdmVyc8OjbyBmaW5hbCBkbyB0cmFiYWxobzsKClZJIC0gcXVhbmRvIHRyYXRhci1zZSBkZSBUcmFiYWxobyBkZSBDb25jbHVzw6NvIGRlIEN1cnNvLCBEaXNzZXJ0YcOnw6NvIG91IFRlc2U6IGVzdG91IGNpZW50ZSBkZSBxdWUgYSBhbHRlcmHDp8OjbyBkYSBtb2RhbGlkYWRlIGRlIGFjZXNzbyBhbyBkb2N1bWVudG8gYXDDs3MgbyBkZXDDs3NpdG8gZSBhbnRlcyBkZSBmaW5kYXIgbyBwZXLDrW9kbyBkZSBlbWJhcmdvLCBxdWFuZG8gZm9yIGVzY29saGlkbyBhY2Vzc28gcmVzdHJpdG8sIHNlcsOhIHBlcm1pdGlkYSBtZWRpYW50ZSBzb2xpY2l0YcOnw6NvIGRvIChhKSBhdXRvciAoYSkgYW8gU2lzdGVtYSBJbnRlZ3JhZG8gZGUgQmlibGlvdGVjYXMgZGEgVUZQRSAoU0lCL1VGUEUpLgoKIApQYXJhIHRyYWJhbGhvcyBlbSBBY2Vzc28gQWJlcnRvOgoKTmEgcXVhbGlkYWRlIGRlIHRpdHVsYXIgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRlIGF1dG9yIHF1ZSByZWNhZW0gc29icmUgZXN0ZSBkb2N1bWVudG8sIGZ1bmRhbWVudGFkbyBuYSBMZWkgZGUgRGlyZWl0byBBdXRvcmFsIG5vIDkuNjEwLCBkZSAxOSBkZSBmZXZlcmVpcm8gZGUgMTk5OCwgYXJ0LiAyOSwgaW5jaXNvIElJSSwgYXV0b3Jpem8gYSBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBQZXJuYW1idWNvIGEgZGlzcG9uaWJpbGl6YXIgZ3JhdHVpdGFtZW50ZSwgc2VtIHJlc3NhcmNpbWVudG8gZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCBwYXJhIGZpbnMgZGUgbGVpdHVyYSwgaW1wcmVzc8OjbyBlL291IGRvd25sb2FkIChhcXVpc2nDp8OjbykgYXRyYXbDqXMgZG8gc2l0ZSBkbyBSZXBvc2l0w7NyaW8gRGlnaXRhbCBkYSBVRlBFIG5vIGVuZGVyZcOnbyBodHRwOi8vd3d3LnJlcG9zaXRvcmlvLnVmcGUuYnIsIGEgcGFydGlyIGRhIGRhdGEgZGUgZGVww7NzaXRvLgoKIApQYXJhIHRyYWJhbGhvcyBlbSBBY2Vzc28gUmVzdHJpdG86CgpOYSBxdWFsaWRhZGUgZGUgdGl0dWxhciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgZGUgYXV0b3IgcXVlIHJlY2FlbSBzb2JyZSBlc3RlIGRvY3VtZW50bywgZnVuZGFtZW50YWRvIG5hIExlaSBkZSBEaXJlaXRvIEF1dG9yYWwgbm8gOS42MTAgZGUgMTkgZGUgZmV2ZXJlaXJvIGRlIDE5OTgsIGFydC4gMjksIGluY2lzbyBJSUksIGF1dG9yaXpvIGEgVW5pdmVyc2lkYWRlIEZlZGVyYWwgZGUgUGVybmFtYnVjbyBhIGRpc3BvbmliaWxpemFyIGdyYXR1aXRhbWVudGUsIHNlbSByZXNzYXJjaW1lbnRvIGRvcyBkaXJlaXRvcyBhdXRvcmFpcywgcGFyYSBmaW5zIGRlIGxlaXR1cmEsIGltcHJlc3PDo28gZS9vdSBkb3dubG9hZCAoYXF1aXNpw6fDo28pIGF0cmF2w6lzIGRvIHNpdGUgZG8gUmVwb3NpdMOzcmlvIERpZ2l0YWwgZGEgVUZQRSBubyBlbmRlcmXDp28gaHR0cDovL3d3dy5yZXBvc2l0b3Jpby51ZnBlLmJyLCBxdWFuZG8gZmluZGFyIG8gcGVyw61vZG8gZGUgZW1iYXJnbyBjb25kaXplbnRlIGFvIHRpcG8gZGUgZG9jdW1lbnRvLCBjb25mb3JtZSBpbmRpY2FkbyBubyBjYW1wbyBEYXRhIGRlIEVtYmFyZ28uCg==Repositório InstitucionalPUBhttps://repositorio.ufpe.br/oai/requestattena@ufpe.bropendoar:22212023-01-04T05:22:51Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)false |
| dc.title.pt_BR.fl_str_mv |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry |
| title |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry |
| spellingShingle |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry MACÊDO, July Bias Engenharia de produção Análise de riscos Relatório de acidentes Processamento de linguagem natural Mineração de texto Refinaria de petróleo Companhia hidroelétrica |
| title_short |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry |
| title_full |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry |
| title_fullStr |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry |
| title_full_unstemmed |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry |
| title_sort |
Development of natural language processing-based solutions for risk analysis : application to a hydropower company and an O&G industry |
| author |
MACÊDO, July Bias |
| author_facet |
MACÊDO, July Bias |
| author_role |
author |
| dc.contributor.authorLattes.pt_BR.fl_str_mv |
http://lattes.cnpq.br/2540702750653143 |
| dc.contributor.advisorLattes.pt_BR.fl_str_mv |
http://lattes.cnpq.br/7778828466828647 |
| dc.contributor.author.fl_str_mv |
MACÊDO, July Bias |
| dc.contributor.advisor1.fl_str_mv |
MOURA, Márcio José das Chagas |
| dc.contributor.advisor-co1.fl_str_mv |
ZIO, Enrico |
| contributor_str_mv |
MOURA, Márcio José das Chagas ZIO, Enrico |
| dc.subject.por.fl_str_mv |
Engenharia de produção Análise de riscos Relatório de acidentes Processamento de linguagem natural Mineração de texto Refinaria de petróleo Companhia hidroelétrica |
| topic |
Engenharia de produção Análise de riscos Relatório de acidentes Processamento de linguagem natural Mineração de texto Refinaria de petróleo Companhia hidroelétrica |
| description |
Risk Analysis (RA) is crucial to prevent and mitigate potential risk events; however, there are several challenges related to RA. For instance, accident investigation reports are useful sources of information to support safety professionals to propose measures to prevent or mitigate identified occupational accident root causes. Nevertheless, reports’ low quality and lack of detail may limit their usefulness. Moreover, the quality of Quantitative Risk Analysis (QRA) strongly relies on the identification of all potential hazards with major consequences related to the operation of an industrial system, which is usually performed by multiple experts and consumes a considerable amount of time and effort. Since valuable knowledge about an industrial system is stored in the form of textual data, Natural Language Processing (NLP) techniques can be helpful since it can be applied to extract, organize, and classify information from text. Although several studies contributed to the advance of RA, most studies applying NLP focus primarily on automatically identifying patterns from reactive data, such as accident reports, and do not consider the quality of information contained in these documents. In addition, different forms of text data store relevant knowledge about industrial systems and their respective risks, especially proactive data such as documents resulting from preliminary risk studies, and adoption of these data could support preventive risk studies. The main purpose of this study is to develop NLP-based solutions to different issues faced in RA. Thus, this thesis presents two methodologies to (i) identify issues in a hydropower company’s accident investigation reports that may compromise their usefulness as a decision support tool (ii) automatically identify risk features from documents to support the initial stage of QRA in Oil and Gas (O&G) industries. Occupational safety technicians can benefit from the methodology that helps to identify issues and propose improvements to the accident reports. In addition, the second methodology can help experts to identify and assess hypothetical accidental scenarios related to the operation of an industrial facility. Thus, this thesis may contribute to the prevention and mitigation of occupational and/or major accidents and consequently avoid/reduce property damage, economic and social disruption, environmental degradation, and human losses. |
| publishDate |
2022 |
| dc.date.issued.fl_str_mv |
2022-12-20 |
| dc.date.accessioned.fl_str_mv |
2023-01-03T13:19:49Z |
| dc.date.available.fl_str_mv |
2023-01-03T13:19:49Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
| format |
doctoralThesis |
| status_str |
publishedVersion |
| dc.identifier.citation.fl_str_mv |
MACÊDO, July Bias. Development of natural language processing-based solutions for risk analysis: application to a hydropower company and an O&G industry. 2022. Tese (Doutorado em Engenharia de Produção) – Universidade Federal de Pernambuco, Recife, 2022. |
| dc.identifier.uri.fl_str_mv |
https://repositorio.ufpe.br/handle/123456789/48492 |
| identifier_str_mv |
MACÊDO, July Bias. Development of natural language processing-based solutions for risk analysis: application to a hydropower company and an O&G industry. 2022. Tese (Doutorado em Engenharia de Produção) – Universidade Federal de Pernambuco, Recife, 2022. |
| url |
https://repositorio.ufpe.br/handle/123456789/48492 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.rights.driver.fl_str_mv |
http://creativecommons.org/licenses/by-nc-nd/3.0/br/ info:eu-repo/semantics/openAccess |
| rights_invalid_str_mv |
http://creativecommons.org/licenses/by-nc-nd/3.0/br/ |
| eu_rights_str_mv |
openAccess |
| dc.publisher.none.fl_str_mv |
Universidade Federal de Pernambuco |
| dc.publisher.program.fl_str_mv |
Programa de Pos Graduacao em Engenharia de Producao |
| dc.publisher.initials.fl_str_mv |
UFPE |
| dc.publisher.country.fl_str_mv |
Brasil |
| publisher.none.fl_str_mv |
Universidade Federal de Pernambuco |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFPE instname:Universidade Federal de Pernambuco (UFPE) instacron:UFPE |
| instname_str |
Universidade Federal de Pernambuco (UFPE) |
| instacron_str |
UFPE |
| institution |
UFPE |
| reponame_str |
Repositório Institucional da UFPE |
| collection |
Repositório Institucional da UFPE |
| bitstream.url.fl_str_mv |
https://repositorio.ufpe.br/bitstream/123456789/48492/2/license_rdf https://repositorio.ufpe.br/bitstream/123456789/48492/1/TESE%20July%20Bias%20Mac%c3%aado.pdf https://repositorio.ufpe.br/bitstream/123456789/48492/3/license.txt https://repositorio.ufpe.br/bitstream/123456789/48492/4/TESE%20July%20Bias%20Mac%c3%aado.pdf.txt https://repositorio.ufpe.br/bitstream/123456789/48492/5/TESE%20July%20Bias%20Mac%c3%aado.pdf.jpg |
| bitstream.checksum.fl_str_mv |
e39d27027a6cc9cb039ad269a5db8e34 9f6996ad5784a31c1f8093a9a61ce539 5e89a1613ddc8510c6576f4b23a78973 499a39497f5b6f0cdda0e0f4055a5a1d b9070f8e065cb7aec44fb42c632d02be |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 |
| repository.name.fl_str_mv |
Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE) |
| repository.mail.fl_str_mv |
attena@ufpe.br |
| _version_ |
1862741983306448896 |