Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates
| Ano de defesa: | 2025 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | |
| Tipo de documento: | Tese |
| Tipo de acesso: | Acesso aberto |
| Idioma: | eng |
| Instituição de defesa: |
Biblioteca Digitais de Teses e Dissertacoes da USP
Universidade de São Paulo Faculdade de Medicina de Ribeirão Preto |
| Programa de Pós-Graduação: |
Não Informado pela instituição
|
| Departamento: |
Não Informado pela instituição
|
| País: |
Não Informado pela instituição
|
| Palavras-chave em Português: | |
| Link de acesso: | https://www.teses.usp.br/teses/disponiveis/17/17139/tde-17032026-141825/ |
Resumo: | The application of statistical models and methods covers studies an research in general. In this study, these tools are used in health data analysis, whereas statistical currently performence an important role in promoting the effectiveness of study results such as clinical trials, meta-analysis to obtain more accurate estimates of treatment effects, epi demiology in identifying risk factors and diseases, among others. The study introduces the idea of semiparametric regression models applied to right-censored survival data. Semiparametric models are nonlinear models that involve parametric and nonparametric components. There are situations where semiparametric models have advantages over parametric models, as they can capture complex relationships in the data without spec ifying a parametric probability distribution, providing a balance between flexibility and interpretability, that is, semiparametric models have some advantages in terms of adjust ment. The Cox model, or Cox proportional hazards model (Cox, 1972), is an example of a semiparametric model. This model is presented in the context of survival analysis, which is an essential tool in investigating phenomena where the time to the occurrence of an event of interest is the central variable. That is, it intrinsically addresses the temporal nature of the data and the possibility of incorporating censoring (Klein and Moeschberger, 2006). The principles of survival analysis are the fundamental basis of this study, where we aim to apply semiparametric transformation models to relax assumptions about the functional form of the baseline hazard, there by obtaining more robust and accurate in ference in health-related research. Semiparametric transformation models represent a f lexible class of survival analysis models that allow for modeling the effects of covariates on event times through an unspecified transformation function of the baseline hazard, while maintaining the nonparametric nature of the latter. The model fitting in the ap plications presented in this study was carried out using the Bayesian approach, since in semiparametric transformation models, Bayesian inference naturally incorporates uncer tainty in terms of the transformation function and the nonparametric baseline hazard. By specifying flexible smoothing priors, such as the Dirichlet process or splines (Ghosal and Van der Vaart, 2017; Wahba, 1990), it is possible to model the complexity of the transfor mation function without imposing rigid parametric forms. The ability to perform exact inference through computational methods such as MCMC (Markov Chain Monte Carlo) overcomes the limitations of asymptotic approximations, making the Bayesian approach a sophisticated and increasingly relevant tool in survival analysis with semiparametric transformation models. In this research, we highlight the use of survival analysis theory widely explored in medical research, where there is a high frequency of studies on patient monitoring until the occurrence of the event of interest. Therefore, the work consists of an introduction covering the literature review in Chapter 1. In Chapter 2, we present a hierarchical Bayesian analysis considering semiparametric transformation models for a dataset composed of survival times of cancer patients admitted to the intensive care unit of the National Cancer Institute (INCA) in Rio de Janeiro, Brazil. In Chapter 3, we show how to obtain inference for the parameters of semiparametric transformation models in the presence of censoring, covariates, and cure fraction under a hierarchical Bayesian approach, assuming unknown hazard rates as latent variables with a specified probability distribution. The subsequent posterior summaries of interest are obtained us ing existing MCMC simulation methods. Two applications considering survival times in medical studies illustrate the proposed methodology. In Chapter 4, we consider the use of semiparametric (transformation) models for situations in which two or more responses are associated with the same individual or unit. We assume a hierarchical Bayesian analysis for semiparametric transformation models, considering the full likelihood function derived from transformation models, treating the unknown hazard functions as latent variables, and applying MCMC methods to obtain the posterior summaries of interest. The depen dence between multivariate responses for the same individual is captured by introducing another latent variable, or frailty. Illustrations of the proposed methodology are presented by considering two multivariate medical datasets on survival times. |
| id |
USP_d70c41c99c7561a8a4b06a59601ee0c0 |
|---|---|
| oai_identifier_str |
oai:teses.usp.br:tde-17032026-141825 |
| network_acronym_str |
USP |
| network_name_str |
Biblioteca Digital de Teses e Dissertações da USP |
| repository_id_str |
|
| spelling |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariatesModelos de transformação semiparamétrico na análise de dados de tempo de vida na presença de dados censurados e covariáveisAbordagem bayesianaSaúde públicaModelos de transformação semiparamétricoFração de curaDados médicos de tempo de vidaDados censuradosCure fractionLifetime medical dataCensored dataPublic healthBayesian approachSemiparametric transformation modelsThe application of statistical models and methods covers studies an research in general. In this study, these tools are used in health data analysis, whereas statistical currently performence an important role in promoting the effectiveness of study results such as clinical trials, meta-analysis to obtain more accurate estimates of treatment effects, epi demiology in identifying risk factors and diseases, among others. The study introduces the idea of semiparametric regression models applied to right-censored survival data. Semiparametric models are nonlinear models that involve parametric and nonparametric components. There are situations where semiparametric models have advantages over parametric models, as they can capture complex relationships in the data without spec ifying a parametric probability distribution, providing a balance between flexibility and interpretability, that is, semiparametric models have some advantages in terms of adjust ment. The Cox model, or Cox proportional hazards model (Cox, 1972), is an example of a semiparametric model. This model is presented in the context of survival analysis, which is an essential tool in investigating phenomena where the time to the occurrence of an event of interest is the central variable. That is, it intrinsically addresses the temporal nature of the data and the possibility of incorporating censoring (Klein and Moeschberger, 2006). The principles of survival analysis are the fundamental basis of this study, where we aim to apply semiparametric transformation models to relax assumptions about the functional form of the baseline hazard, there by obtaining more robust and accurate in ference in health-related research. Semiparametric transformation models represent a f lexible class of survival analysis models that allow for modeling the effects of covariates on event times through an unspecified transformation function of the baseline hazard, while maintaining the nonparametric nature of the latter. The model fitting in the ap plications presented in this study was carried out using the Bayesian approach, since in semiparametric transformation models, Bayesian inference naturally incorporates uncer tainty in terms of the transformation function and the nonparametric baseline hazard. By specifying flexible smoothing priors, such as the Dirichlet process or splines (Ghosal and Van der Vaart, 2017; Wahba, 1990), it is possible to model the complexity of the transfor mation function without imposing rigid parametric forms. The ability to perform exact inference through computational methods such as MCMC (Markov Chain Monte Carlo) overcomes the limitations of asymptotic approximations, making the Bayesian approach a sophisticated and increasingly relevant tool in survival analysis with semiparametric transformation models. In this research, we highlight the use of survival analysis theory widely explored in medical research, where there is a high frequency of studies on patient monitoring until the occurrence of the event of interest. Therefore, the work consists of an introduction covering the literature review in Chapter 1. In Chapter 2, we present a hierarchical Bayesian analysis considering semiparametric transformation models for a dataset composed of survival times of cancer patients admitted to the intensive care unit of the National Cancer Institute (INCA) in Rio de Janeiro, Brazil. In Chapter 3, we show how to obtain inference for the parameters of semiparametric transformation models in the presence of censoring, covariates, and cure fraction under a hierarchical Bayesian approach, assuming unknown hazard rates as latent variables with a specified probability distribution. The subsequent posterior summaries of interest are obtained us ing existing MCMC simulation methods. Two applications considering survival times in medical studies illustrate the proposed methodology. In Chapter 4, we consider the use of semiparametric (transformation) models for situations in which two or more responses are associated with the same individual or unit. We assume a hierarchical Bayesian analysis for semiparametric transformation models, considering the full likelihood function derived from transformation models, treating the unknown hazard functions as latent variables, and applying MCMC methods to obtain the posterior summaries of interest. The depen dence between multivariate responses for the same individual is captured by introducing another latent variable, or frailty. Illustrations of the proposed methodology are presented by considering two multivariate medical datasets on survival times.A aplicação de modelos e métodos estatísticos abrange estudos e pesquisas em geral. Nesse estudo, essas ferramentas são usadas nas análises de dados da saúde, visto que a estatística atualmente desempenha um importante papel na promoção da eficácia dos resultados dos estudos como ensaios clínicos, metanálise para obter estimativas mais precisas dos efeitos de tratamentos, epidemiologia na identificação de fatores de riscos e doenças, entre outros. O estudo introduz a ideia de modelos de regressão semiparamétricos aplicados a dados de sobrevida censurados à direita. Os modelos semiparamétricos, são modelos não lineares que envolve componentes paramétricos e não paramétricos. Existem situações que os modelos semiparamétricos possuem vantagens sobre os modelos paramétricos, pois podem capturar relações complexas nos dados sem especificar uma distribuição de probabilidade paramétrica, proporcionando um equilíbrio entre flexibilidade e interpretabilidade, ou seja, os modelos semiparamétricos apresentam algumas vantagens em termos de ajuste. O modelo de Cox ou modelo de riscos proporcionais de Cox (Cox, 1972) é um exemplo de modelo semiparamétrico. Este modelo é apresentado no contexto da análise de sobrevivência, que é uma ferramenta essencial na investigação de fenômenos onde o tempo até a ocorrência de um evento de interesse é a variável central, ou seja, lida intrinsecamente com a natureza temporal dos dados e ainda com a possibilidade de incorporar censura (Klein and Moeschberger, 2006). Os princípios da análise de sobrevivência são as bases fundamental do estudo, onde objetivamos aplicar modelos de transformação semiparamétrico para flexibilizar as suposições sobre a forma funcional do risco basal e, assim, obter inferência mais robusta e precisa em pesquisas relacionadas à saúde. Os modelos de transformação semiparamétrico representam uma classe flexível de modelos de análise de sobrevivência que permitem modelar os efeitos de covariáveis no tempo do evento por meio de uma função de transformação não especificada para o risco basal, mantendo a natureza não paramétrica deste último. Os ajustes dos modelos apresentados nas aplicações do estudo foram realizados utilizando a abordagem bayesiana, uma vez que, em modelos de transformação semiparamétrico, a inferência bayesiana permite a incorporação natural da incerteza em termos da função de transformação e do risco basal não paramétrico. Especificando priores de suavização flexíveis, como o processo de Dirichlet ou splines (Ghosal and Van der Vaart, 2017; Wahba, 1990), é possível modelar a complexidade da função de transformação impondo formas paramétricas rígidas. A capacidade de realizar inferências exatas por meio de métodos computacionais como MCMC (Markov Chain Monte Carlo) contorna as limitações das aproximações assintóticas, tornando a abordagem bayesiana uma ferramenta sofisticada e cada vez mais relevante em análises de sobrevida com modelos de transformação semiparamétrico. Na pesquisa destacamos o uso da teoria de análise de sobrevivência, amplamente explorada nas pesquisas médicas, ou seja, onde há grande frequência de estudos sobre monitoramento de pacientes até a ocorrência do evento de interesse. Portanto, o trabalho consiste em uma introdução que abrange a revisão de literatura no capítulo 1. No capítulo 2, apresenta-se uma análise bayesiana hierárquica considerando modelos de transformação semiparamétrico para um conjunto de dados composto pelos tempos de sobrevida de pacientes com câncer internados na unidade de terapia intensiva do Instituto Nacional de Câncer (INCA) do Rio de Janeiro, Brasil. No capítulo 3, mostramos como obter inferência para os parâmetros de modelos de transformação semiparamétrico na presença de censura, covariáveis e fração de cura sob uma abordagem bayesiana hierárquica, assumindo as taxas de risco desconhecidas como variáveis latentes, assumindo uma distribuição de probabilidade especificada. Os resumos posteriores de interesse são obtidos utilizando métodos de simulação MCMC existentes. Duas aplicações considerando o tempo de vida em estudos médicos ilustram a metodologia proposta. No capítulo 4, consideramos o uso de modelos de transformação semiparamétrico para situações em que há duas ou mais respostas associadas ao mesmo indivíduo ou unidade. Assumimos uma análise bayesiana hierárquica para os modelos de transformação semiparamétrico, considerando a função de verossimilhança completa obtida a partir dos modelos de transformação, considerando as funções de risco desconhecidas como variáveis latentes desconhecidas e métodos de MCMC para obter os resumos posteriores de interesse. A dependência entre as respostas multivariadas para o mesmo indivíduo é capturada pela introdução de outra variável latente ou fragilidade. Ilustrações de metodologia proposta são apresentadas considerando dois conjuntos de dados médicos multivariados sobre o tempo de vida.Biblioteca Digitais de Teses e Dissertacoes da USPUniversidade de São PauloFaculdade de Medicina de Ribeirão PretoAchcar, Jorge AlbertoBarili, Emerson2025-11-132026-04-16info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisapplication/pdfhttps://www.teses.usp.br/teses/disponiveis/17/17139/tde-17032026-141825/doi:10.11606/T.17.2025.tde-17032026-141825Liberar o conteúdo para acesso público.info:eu-repo/semantics/openAccessengreponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USP2026-04-16T12:17:02Zoai:teses.usp.br:tde-17032026-141825Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.bropendoar:27212026-04-16T12:17:02Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false |
| dc.title.none.fl_str_mv |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates Modelos de transformação semiparamétrico na análise de dados de tempo de vida na presença de dados censurados e covariáveis |
| title |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates |
| spellingShingle |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates Barili, Emerson Abordagem bayesiana Saúde pública Modelos de transformação semiparamétrico Fração de cura Dados médicos de tempo de vida Dados censurados Cure fraction Lifetime medical data Censored data Public health Bayesian approach Semiparametric transformation models |
| title_short |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates |
| title_full |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates |
| title_fullStr |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates |
| title_full_unstemmed |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates |
| title_sort |
Semiparametric transformation models in the analysis of lifetime data in presence of censored data and covariates |
| author |
Barili, Emerson |
| author_facet |
Barili, Emerson |
| author_role |
author |
| dc.contributor.none.fl_str_mv |
Achcar, Jorge Alberto |
| dc.contributor.author.fl_str_mv |
Barili, Emerson |
| dc.subject.por.fl_str_mv |
Abordagem bayesiana Saúde pública Modelos de transformação semiparamétrico Fração de cura Dados médicos de tempo de vida Dados censurados Cure fraction Lifetime medical data Censored data Public health Bayesian approach Semiparametric transformation models |
| topic |
Abordagem bayesiana Saúde pública Modelos de transformação semiparamétrico Fração de cura Dados médicos de tempo de vida Dados censurados Cure fraction Lifetime medical data Censored data Public health Bayesian approach Semiparametric transformation models |
| description |
The application of statistical models and methods covers studies an research in general. In this study, these tools are used in health data analysis, whereas statistical currently performence an important role in promoting the effectiveness of study results such as clinical trials, meta-analysis to obtain more accurate estimates of treatment effects, epi demiology in identifying risk factors and diseases, among others. The study introduces the idea of semiparametric regression models applied to right-censored survival data. Semiparametric models are nonlinear models that involve parametric and nonparametric components. There are situations where semiparametric models have advantages over parametric models, as they can capture complex relationships in the data without spec ifying a parametric probability distribution, providing a balance between flexibility and interpretability, that is, semiparametric models have some advantages in terms of adjust ment. The Cox model, or Cox proportional hazards model (Cox, 1972), is an example of a semiparametric model. This model is presented in the context of survival analysis, which is an essential tool in investigating phenomena where the time to the occurrence of an event of interest is the central variable. That is, it intrinsically addresses the temporal nature of the data and the possibility of incorporating censoring (Klein and Moeschberger, 2006). The principles of survival analysis are the fundamental basis of this study, where we aim to apply semiparametric transformation models to relax assumptions about the functional form of the baseline hazard, there by obtaining more robust and accurate in ference in health-related research. Semiparametric transformation models represent a f lexible class of survival analysis models that allow for modeling the effects of covariates on event times through an unspecified transformation function of the baseline hazard, while maintaining the nonparametric nature of the latter. The model fitting in the ap plications presented in this study was carried out using the Bayesian approach, since in semiparametric transformation models, Bayesian inference naturally incorporates uncer tainty in terms of the transformation function and the nonparametric baseline hazard. By specifying flexible smoothing priors, such as the Dirichlet process or splines (Ghosal and Van der Vaart, 2017; Wahba, 1990), it is possible to model the complexity of the transfor mation function without imposing rigid parametric forms. The ability to perform exact inference through computational methods such as MCMC (Markov Chain Monte Carlo) overcomes the limitations of asymptotic approximations, making the Bayesian approach a sophisticated and increasingly relevant tool in survival analysis with semiparametric transformation models. In this research, we highlight the use of survival analysis theory widely explored in medical research, where there is a high frequency of studies on patient monitoring until the occurrence of the event of interest. Therefore, the work consists of an introduction covering the literature review in Chapter 1. In Chapter 2, we present a hierarchical Bayesian analysis considering semiparametric transformation models for a dataset composed of survival times of cancer patients admitted to the intensive care unit of the National Cancer Institute (INCA) in Rio de Janeiro, Brazil. In Chapter 3, we show how to obtain inference for the parameters of semiparametric transformation models in the presence of censoring, covariates, and cure fraction under a hierarchical Bayesian approach, assuming unknown hazard rates as latent variables with a specified probability distribution. The subsequent posterior summaries of interest are obtained us ing existing MCMC simulation methods. Two applications considering survival times in medical studies illustrate the proposed methodology. In Chapter 4, we consider the use of semiparametric (transformation) models for situations in which two or more responses are associated with the same individual or unit. We assume a hierarchical Bayesian analysis for semiparametric transformation models, considering the full likelihood function derived from transformation models, treating the unknown hazard functions as latent variables, and applying MCMC methods to obtain the posterior summaries of interest. The depen dence between multivariate responses for the same individual is captured by introducing another latent variable, or frailty. Illustrations of the proposed methodology are presented by considering two multivariate medical datasets on survival times. |
| publishDate |
2025 |
| dc.date.none.fl_str_mv |
2025-11-13 2026-04-16 |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
| format |
doctoralThesis |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
https://www.teses.usp.br/teses/disponiveis/17/17139/tde-17032026-141825/ doi:10.11606/T.17.2025.tde-17032026-141825 |
| url |
https://www.teses.usp.br/teses/disponiveis/17/17139/tde-17032026-141825/ |
| identifier_str_mv |
doi:10.11606/T.17.2025.tde-17032026-141825 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.rights.driver.fl_str_mv |
Liberar o conteúdo para acesso público. info:eu-repo/semantics/openAccess |
| rights_invalid_str_mv |
Liberar o conteúdo para acesso público. |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.publisher.none.fl_str_mv |
Biblioteca Digitais de Teses e Dissertacoes da USP Universidade de São Paulo Faculdade de Medicina de Ribeirão Preto |
| publisher.none.fl_str_mv |
Biblioteca Digitais de Teses e Dissertacoes da USP Universidade de São Paulo Faculdade de Medicina de Ribeirão Preto |
| dc.source.none.fl_str_mv |
reponame:Biblioteca Digital de Teses e Dissertações da USP instname:Universidade de São Paulo (USP) instacron:USP |
| instname_str |
Universidade de São Paulo (USP) |
| instacron_str |
USP |
| institution |
USP |
| reponame_str |
Biblioteca Digital de Teses e Dissertações da USP |
| collection |
Biblioteca Digital de Teses e Dissertações da USP |
| repository.name.fl_str_mv |
Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP) |
| repository.mail.fl_str_mv |
virginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.br |
| _version_ |
1865492704353320960 |