Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin

Detalhes bibliográficos
Ano de defesa: 2024
Autor(a) principal: Araújo, Fellippe Roberto Alves Bione de
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
dARK ID: ark:/87559/00130000094pk
Idioma: eng
Instituição de defesa: Não Informado pela instituição
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
TOC
COT
Link de acesso: https://app.uff.br/riuff/handle/1/33962
Resumo: Understanding the spatial distribution of potential source rocks in a sedimentary basin is fundamental for paleoenvironmental interpretations, petroleum system modeling and exploration risk assessment. One way to efficiently assess source rocks' potential and distribution is through total organic carbon (TOC) and organic facies modeling. In this thesis, Sections 1 – 5 are focused in presenting process-based algorithms that were developed from a conceptual model of marine organic carbon deposition, including empirical equations for primary productivity, carbon flux and burial efficiency that allow to estimate the autochthonous and allochthonous fractions that compose TOC. Based on the calculated component fractions of organic carbon, a classification algorithm using fuzzy logics and reference values of hydrogen index and oxygen index is applied to obtain the correspondent organic facies classes. These algorithms are presented as an alternative for estimating TOC and organic facies classifications in marine environments of the Brazilian marginal basins. To test the applicability of the proposed methods, two simulation approaches are presented for the Espírito Santo Basin (ESB) offshore region: the first, a series of 2D simulations utilizing modern surficial sediments data and the second, 1D simulations of deep geological time using available data of 24 wells. The simulations show good adherence with a similar evaluation conducted for the whole South Atlantic Ocean but provide new approaches, specifically targeting the ESB. Different statistical methods and the Dynamic Time Warping technique are used to evaluate simulations' performance and obtain the optimal configurations for the ESB. The results indicate the predominance of transgressive regimens through the analyzed stratigraphic intervals and organic facies classifications of mixed continental and marine contributions, both in accordance to previously known characteristics of ESB stratigraphic record. The occurrence of TOC anomalies interpreted as related to Oceanic Anoxic Events, previously reported for some of the analyzed wells is successfully captured by the conducted process-based simulations. Ultimately, the findings presented in this work provide insights about the practical application of the proposed methods in the Brazilian continental margin. Alternatively, in Section 6, a different simulation approach, focusing on creating a generalized model for TOC prediction is presented. In this case, the XGBoost machine learning algorithm was applied to a compiled comprehensive data set containing well log and geochemical data from the ESB to run multiple solutions of parameter tuning and effectively predicting TOC for unconstrained stratigraphic intervals. This approach is then compared the traditional ΔlogR method, outperforming the latter. XGBoost effectively predicted TOC, yielding a coefficient of determination R2 of 0.71, RMSE of 0.55 and MAE of 0.30, based on the average of all 10-fold cross-validation test sets for a large dataset, containing 6353 observed TOC entries, thus, indicate the potential of machine learning for TOC prediction in large, heterogeneous data sets, configuring a promising tool for the usage of available public data sets in similar applications, such as the oil and gas (O&G) industry's exploration phase or field reassessment.
id UFF-2_6f4430a0dfc691a944bf84e0b6aacc25
oai_identifier_str oai:app.uff.br:1/33962
network_acronym_str UFF-2
network_name_str Repositório Institucional da Universidade Federal Fluminense (RIUFF)
repository_id_str
spelling Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basinTOCSource rocksMarine environmentGeological modelingGeological dataGeochemistryWell logsXGBoostCarbono orgânicoFácies sedimentaresModelagem geológicaRocha geradoraModelagem computacionalGeoquímicaCOTRochas geradorasAmbiente marinhoModelagem geológicaDados geológicosGeoquímicaPerfis geofísicos de poçosXGBoostUnderstanding the spatial distribution of potential source rocks in a sedimentary basin is fundamental for paleoenvironmental interpretations, petroleum system modeling and exploration risk assessment. One way to efficiently assess source rocks' potential and distribution is through total organic carbon (TOC) and organic facies modeling. In this thesis, Sections 1 – 5 are focused in presenting process-based algorithms that were developed from a conceptual model of marine organic carbon deposition, including empirical equations for primary productivity, carbon flux and burial efficiency that allow to estimate the autochthonous and allochthonous fractions that compose TOC. Based on the calculated component fractions of organic carbon, a classification algorithm using fuzzy logics and reference values of hydrogen index and oxygen index is applied to obtain the correspondent organic facies classes. These algorithms are presented as an alternative for estimating TOC and organic facies classifications in marine environments of the Brazilian marginal basins. To test the applicability of the proposed methods, two simulation approaches are presented for the Espírito Santo Basin (ESB) offshore region: the first, a series of 2D simulations utilizing modern surficial sediments data and the second, 1D simulations of deep geological time using available data of 24 wells. The simulations show good adherence with a similar evaluation conducted for the whole South Atlantic Ocean but provide new approaches, specifically targeting the ESB. Different statistical methods and the Dynamic Time Warping technique are used to evaluate simulations' performance and obtain the optimal configurations for the ESB. The results indicate the predominance of transgressive regimens through the analyzed stratigraphic intervals and organic facies classifications of mixed continental and marine contributions, both in accordance to previously known characteristics of ESB stratigraphic record. The occurrence of TOC anomalies interpreted as related to Oceanic Anoxic Events, previously reported for some of the analyzed wells is successfully captured by the conducted process-based simulations. Ultimately, the findings presented in this work provide insights about the practical application of the proposed methods in the Brazilian continental margin. Alternatively, in Section 6, a different simulation approach, focusing on creating a generalized model for TOC prediction is presented. In this case, the XGBoost machine learning algorithm was applied to a compiled comprehensive data set containing well log and geochemical data from the ESB to run multiple solutions of parameter tuning and effectively predicting TOC for unconstrained stratigraphic intervals. This approach is then compared the traditional ΔlogR method, outperforming the latter. XGBoost effectively predicted TOC, yielding a coefficient of determination R2 of 0.71, RMSE of 0.55 and MAE of 0.30, based on the average of all 10-fold cross-validation test sets for a large dataset, containing 6353 observed TOC entries, thus, indicate the potential of machine learning for TOC prediction in large, heterogeneous data sets, configuring a promising tool for the usage of available public data sets in similar applications, such as the oil and gas (O&G) industry's exploration phase or field reassessment.Compreender a distribuição espacial de rochas com potencial gerador em uma bacia sedimentar é fundamental para interpretações paleoambientais, modelagem de sistemas petrolíferos e diminuição de riscos exploratórios. Uma forma eficaz de avaliar a distribuição espacial de rochas potencialmente geradoras se dá através da modelagem de carbono orgânico total (COT) e de fácies orgânica. Nesta tese, nos Capítulos 1 – 5 abordam-se algoritmos de simulação baseados em processos, desenvolvidos a partir de um modelo conceitual de deposição de carbono orgânico marinho, incluindo equações empíricas para a produtividade primária, fluxo de carbono e eficiência de soterramento, que permitem estimar as frações autóctones e alóctones que compõem o COT. Com base nestas frações componentes calculadas, um algoritmo de classificação baseado em fuzzy logics e valores de referência de índice de hidrogénio e índice de oxigênio é aplicado para a obtenção das classes de fácies orgânicas correspondentes. Estes algoritmos são apresentados como uma alternativa para estimar o COT e classificações de fácies orgânicas em ambientes marinhos de bacias marginais brasileiras. Para testar a aplicabilidade dos métodos propostos, duas abordagens de simulação foram testadas na região offshore da Bacia do Espírito Santo (BES): a primeira, uma série de simulações 2D utilizando dados de sedimentos superficiais modernos e a segunda, simulações 1D no tempo geológico profundo utilizando dados disponíveis de 24 poços. As simulações realizadas mostram boa aderência quando comparadas a um estudo semelhante, realizado para toda a região do Oceano Atlântico Sul, mas fornecem novas abordagens, visando especificamente a BES. Diferentes métodos estatísticos e o método de Dynamic Time Warping são utilizados para avaliar o desempenho das simulações e obter as melhores configurações para a BES. Os resultados indicam a predominância de regimes transgressivos ao longo dos intervalos estratigráficos analisados e classificações de fácies orgânicas compatíveis com contribuições mistas de matéria orgânica marinha e continental. Ambas as observações estão de acordo com características previamente descritas a partir do registro estratigráfico da BES. A ocorrência de anomalias de COT interpretadas como relacionadas a Eventos Anóxicos Oceânicos, previamente descritas em alguns dos poços analisados, foram identificadas com sucesso pelas simulações baseadas em processos realizadas. Em última análise, os resultados apresentados no presente trabalho fornecem informações acerca da aplicação prática dos métodos propostos na margem continental brasileira. Alternativamente, no Capítulo 6, apresenta-se uma abordagem de simulação de COT com foco obtenção de modelos generalistas. Neste caso, o algoritmo de aprendizagem de máquina XGBoost foi utilizado sobre um conjunto de dados abrangente, composto por dados de perfis geofísicos de poços e geoquímicos da BES. Foram executadas múltiplas rodadas de soluções para a obtenção de parâmetros para a obtenção do modelo preditivo de COT para intervalos sem controle estratigráfico. Esta abordagem foi comparada com o método tradicional de predição de TOC, ΔlogR, superando este último. O XGBoost previu eficazmente o COT, produzindo um coeficiente de determinação R2 de 0.71, RMSE de 0.55 e MAE de 0.30, com base na média entre todos os conjuntos de testes de validação cruzada para um grande conjunto de dados, contendo 6353 entradas de COT observadas. Os resultados indicam, assim, o potencial da aprendizagem de máquina para a predição de COT em grandes conjuntos de dados heterogêneos, configurando uma ferramenta promissora para a utilização de conjuntos de dados públicos disponíveis em aplicações semelhantes, durante fases de exploração da indústria de petróleo e gás, ou em estudos de reavaliação de campos maduros.182 f.Albuquerque, Ana Luiza Spadanohttp://lattes.cnpq.br/4016720596063058Belém, Andre Luizhttp://lattes.cnpq.br/8174173696509765Almeida, Anderson Gomes dehttp://lattes.cnpq.br/3289186828253975Reis, Antonio Tadeu doshttp://lattes.cnpq.br/8212550476765235Silveira, Carla Semiramishttp://lattes.cnpq.br/5607967300278589Oliveira, Igor Martins Venâncio Padilha dehttp://lattes.cnpq.br/8375137961138590http://lattes.cnpq.br/5917855779924288Araújo, Fellippe Roberto Alves Bione de2024-08-07T16:38:58Z2024-08-07T16:38:58Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfARAÚJO, Fellippe Roberto Alves Bione de. Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin. 2024. 182 f. Tese (Doutorado em Geociências - Geoquímica ambiental) - Universidade Federal Fluminense, Niterói, 2024.https://app.uff.br/riuff/handle/1/33962ark:/87559/00130000094pkCC-BY-SAinfo:eu-repo/semantics/openAccessengreponame:Repositório Institucional da Universidade Federal Fluminense (RIUFF)instname:Universidade Federal Fluminense (UFF)instacron:UFF2025-05-07T14:13:09Zoai:app.uff.br:1/33962Repositório InstitucionalPUBhttps://app.uff.br/oai/requestriuff@id.uff.bropendoar:21202025-05-07T14:13:09Repositório Institucional da Universidade Federal Fluminense (RIUFF) - Universidade Federal Fluminense (UFF)false
dc.title.none.fl_str_mv Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
title Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
spellingShingle Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
Araújo, Fellippe Roberto Alves Bione de
TOC
Source rocks
Marine environment
Geological modeling
Geological data
Geochemistry
Well logs
XGBoost
Carbono orgânico
Fácies sedimentares
Modelagem geológica
Rocha geradora
Modelagem computacional
Geoquímica
COT
Rochas geradoras
Ambiente marinho
Modelagem geológica
Dados geológicos
Geoquímica
Perfis geofísicos de poços
XGBoost
title_short Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
title_full Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
title_fullStr Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
title_full_unstemmed Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
title_sort Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin
author Araújo, Fellippe Roberto Alves Bione de
author_facet Araújo, Fellippe Roberto Alves Bione de
author_role author
dc.contributor.none.fl_str_mv Albuquerque, Ana Luiza Spadano
http://lattes.cnpq.br/4016720596063058
Belém, Andre Luiz
http://lattes.cnpq.br/8174173696509765
Almeida, Anderson Gomes de
http://lattes.cnpq.br/3289186828253975
Reis, Antonio Tadeu dos
http://lattes.cnpq.br/8212550476765235
Silveira, Carla Semiramis
http://lattes.cnpq.br/5607967300278589
Oliveira, Igor Martins Venâncio Padilha de
http://lattes.cnpq.br/8375137961138590
http://lattes.cnpq.br/5917855779924288
dc.contributor.author.fl_str_mv Araújo, Fellippe Roberto Alves Bione de
dc.subject.por.fl_str_mv TOC
Source rocks
Marine environment
Geological modeling
Geological data
Geochemistry
Well logs
XGBoost
Carbono orgânico
Fácies sedimentares
Modelagem geológica
Rocha geradora
Modelagem computacional
Geoquímica
COT
Rochas geradoras
Ambiente marinho
Modelagem geológica
Dados geológicos
Geoquímica
Perfis geofísicos de poços
XGBoost
topic TOC
Source rocks
Marine environment
Geological modeling
Geological data
Geochemistry
Well logs
XGBoost
Carbono orgânico
Fácies sedimentares
Modelagem geológica
Rocha geradora
Modelagem computacional
Geoquímica
COT
Rochas geradoras
Ambiente marinho
Modelagem geológica
Dados geológicos
Geoquímica
Perfis geofísicos de poços
XGBoost
description Understanding the spatial distribution of potential source rocks in a sedimentary basin is fundamental for paleoenvironmental interpretations, petroleum system modeling and exploration risk assessment. One way to efficiently assess source rocks' potential and distribution is through total organic carbon (TOC) and organic facies modeling. In this thesis, Sections 1 – 5 are focused in presenting process-based algorithms that were developed from a conceptual model of marine organic carbon deposition, including empirical equations for primary productivity, carbon flux and burial efficiency that allow to estimate the autochthonous and allochthonous fractions that compose TOC. Based on the calculated component fractions of organic carbon, a classification algorithm using fuzzy logics and reference values of hydrogen index and oxygen index is applied to obtain the correspondent organic facies classes. These algorithms are presented as an alternative for estimating TOC and organic facies classifications in marine environments of the Brazilian marginal basins. To test the applicability of the proposed methods, two simulation approaches are presented for the Espírito Santo Basin (ESB) offshore region: the first, a series of 2D simulations utilizing modern surficial sediments data and the second, 1D simulations of deep geological time using available data of 24 wells. The simulations show good adherence with a similar evaluation conducted for the whole South Atlantic Ocean but provide new approaches, specifically targeting the ESB. Different statistical methods and the Dynamic Time Warping technique are used to evaluate simulations' performance and obtain the optimal configurations for the ESB. The results indicate the predominance of transgressive regimens through the analyzed stratigraphic intervals and organic facies classifications of mixed continental and marine contributions, both in accordance to previously known characteristics of ESB stratigraphic record. The occurrence of TOC anomalies interpreted as related to Oceanic Anoxic Events, previously reported for some of the analyzed wells is successfully captured by the conducted process-based simulations. Ultimately, the findings presented in this work provide insights about the practical application of the proposed methods in the Brazilian continental margin. Alternatively, in Section 6, a different simulation approach, focusing on creating a generalized model for TOC prediction is presented. In this case, the XGBoost machine learning algorithm was applied to a compiled comprehensive data set containing well log and geochemical data from the ESB to run multiple solutions of parameter tuning and effectively predicting TOC for unconstrained stratigraphic intervals. This approach is then compared the traditional ΔlogR method, outperforming the latter. XGBoost effectively predicted TOC, yielding a coefficient of determination R2 of 0.71, RMSE of 0.55 and MAE of 0.30, based on the average of all 10-fold cross-validation test sets for a large dataset, containing 6353 observed TOC entries, thus, indicate the potential of machine learning for TOC prediction in large, heterogeneous data sets, configuring a promising tool for the usage of available public data sets in similar applications, such as the oil and gas (O&G) industry's exploration phase or field reassessment.
publishDate 2024
dc.date.none.fl_str_mv 2024-08-07T16:38:58Z
2024-08-07T16:38:58Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv ARAÚJO, Fellippe Roberto Alves Bione de. Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin. 2024. 182 f. Tese (Doutorado em Geociências - Geoquímica ambiental) - Universidade Federal Fluminense, Niterói, 2024.
https://app.uff.br/riuff/handle/1/33962
dc.identifier.dark.fl_str_mv ark:/87559/00130000094pk
identifier_str_mv ARAÚJO, Fellippe Roberto Alves Bione de. Modeling total organic carbon and organic facies: process-based and machine learning algorithms for the Espírito Santo basin. 2024. 182 f. Tese (Doutorado em Geociências - Geoquímica ambiental) - Universidade Federal Fluminense, Niterói, 2024.
ark:/87559/00130000094pk
url https://app.uff.br/riuff/handle/1/33962
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv CC-BY-SA
info:eu-repo/semantics/openAccess
rights_invalid_str_mv CC-BY-SA
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Institucional da Universidade Federal Fluminense (RIUFF)
instname:Universidade Federal Fluminense (UFF)
instacron:UFF
instname_str Universidade Federal Fluminense (UFF)
instacron_str UFF
institution UFF
reponame_str Repositório Institucional da Universidade Federal Fluminense (RIUFF)
collection Repositório Institucional da Universidade Federal Fluminense (RIUFF)
repository.name.fl_str_mv Repositório Institucional da Universidade Federal Fluminense (RIUFF) - Universidade Federal Fluminense (UFF)
repository.mail.fl_str_mv riuff@id.uff.br
_version_ 1848091221032960000