USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS

Detalhes bibliográficos
Ano de defesa: 2026
Autor(a) principal: SANTOS, Heloísa Baima da Silva lattes
Orientador(a): FRANÇA, Ana Karina Teixeira da Cunha lattes
Banca de defesa: FRANÇA, Ana Karina Teixeira da Cunha lattes, SANTOS, Alcione Miranda dos lattes, SOUSA JÚNIOR, Carlos Magno lattes, NASCIMENTO, Joelma Ximenes Prado Teixeira lattes, BOGÉA, Eduarda Gomes lattes
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal do Maranhão
Programa de Pós-Graduação: PROGRAMA DE PÓS-GRADUAÇÃO EM SAÚDE COLETIVA/CCBS
Departamento: COORDENAÇÃO DO CURSO DE NUTRIÇÃO/CCBS
País: Brasil
Palavras-chave em Português:
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: https://tedebc.ufma.br/jspui/handle/tede/6815
Resumo: Considering the rising prevalence of obesity in Brazil and worldwide, the limitations of Body Mass Index (BMI) in differentiating lean mass from fat mass, as well as the absence of a precise and accessible method to assess body fat percentage (%BF), this study aimed to develop models capable of estimating %BF in adults, using demographic data and simple anthropometric measures, through Machine Learning (ML) techniques. All statistical analyses and model construction were performed using the R® programming language. The study population consisted of 7,085 adults aged 22 and 30 years, belonging to the 1993 and 1982 Pelotas-RS birth cohorts, respectively. Participants' BMI values were calculated and classified according to the World Health Organization. Individuals were further categorized based on %BF values into obese (≥ 25% for men and ≥ 32% for women) and non-obese (≤ 25% for men and ≤ 32% for women). After BMI and %BF classification, individuals with BMI < 25 kg/m² were also grouped into normal weight (BMI < 25 kg/m² and absence of obesity by %BF) and normalweight obese (BMI < 25 kg/m² and presence of obesity by %BF). Input variables considered were demographic data (sex and age) and anthropometric measures (weight, height, and waist (WC), hip (HC), right wrist, and right calf circumferences). The outcome was defined as the %BF measured by Dual-Energy X-ray Absorptiometry (DXA). For model construction, the Radial Support Vector Machine (SVM) algorithm was applied, adopting Linear Regression (LR) as a baseline. Data partitioning followed the Hold-Out method, with 80% comprising the training set and 20% the test set. Exclusively in the training group, the k-fold Cross-Validation technique was applied, adopting k = 5. To evaluate performance, the following metrics were calculated: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Normalized Root Mean Square Error (NRMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R²). Subsequently, in the best-performing model, the SHAP function was applied to select the most relevant variables for constructing two reduced versions: Reduced 1 (containing only the five most relevant variables based on SHAP calculation – sex, WC, HC, height, and weight, in order of importance) and Reduced 2 (including the variable age). The Bland-Altman plot and Intraclass Correlation Coefficient (ICC) were applied to evaluate the agreement between the %BF estimated by the models and the %BF measured by DXA. The frequency of overweight was 26.5% and obesity 14.1%, considering BMI. However, by %BF, 52.1% of individuals were classified as obese. Notably, normal-weight obese individuals corresponded to 17.3% of the general sample. All ML models yielded better results compared to LR, with the Radial SVM - Reduced 2 achieving the best performance and agreement (MAE = 2.98; RMSE = 3.72; NRMSE = 7.07; R² = 0.91, MAPE = 13.24% and ICC = 0.95 [95% CI: 0.947 – 0.957]). It is concluded that the developed ML models, using simple demographic and anthropometric variables, presented high performance and excellent agreement for estimating %BF in adults, being potentially applicable in epidemiological research and clinical settings, including those with scarce resources, especially in Primary Health Care. It is noteworthy that the Radial SVM – Reduced 2 model proved to be accurate and parsimonious, requiring a smaller number of variables for its application.
id UFMA_19ea351c7f2ab664ace8b2b5bc54fed2
oai_identifier_str oai:tede2:tede/6815
network_acronym_str UFMA
network_name_str Biblioteca Digital de Teses e Dissertações da UFMA
repository_id_str
spelling FRANÇA, Ana Karina Teixeira da Cunhahttp://lattes.cnpq.br/8389486900285691SANTOS, Alcione Miranda doshttp://lattes.cnpq.br/2709550775435326FRANÇA, Ana Karina Teixeira da Cunhahttp://lattes.cnpq.br/8389486900285691SANTOS, Alcione Miranda doshttp://lattes.cnpq.br/2709550775435326SOUSA JÚNIOR, Carlos Magnohttp://lattes.cnpq.br/9561853644051629NASCIMENTO, Joelma Ximenes Prado Teixeirahttp://lattes.cnpq.br/9553463144721017BOGÉA, Eduarda Gomeshttp://lattes.cnpq.br/6613731210897851http://lattes.cnpq.br/3636552632794234SANTOS, Heloísa Baima da Silva2026-03-03T18:46:21Z2026-02-11SANTOS, Heloísa Baima da Silva. Uso de técnicas de aprendizado de máquina para estimação de gordura corporal em adultos brasileiros. 2026. 17 f. Dissertação( Programa de Pós-graduação em Saúde Coletiva/CCBS) - Universidade Federal do Maranhão, São Luís, 2026.https://tedebc.ufma.br/jspui/handle/tede/6815Considering the rising prevalence of obesity in Brazil and worldwide, the limitations of Body Mass Index (BMI) in differentiating lean mass from fat mass, as well as the absence of a precise and accessible method to assess body fat percentage (%BF), this study aimed to develop models capable of estimating %BF in adults, using demographic data and simple anthropometric measures, through Machine Learning (ML) techniques. All statistical analyses and model construction were performed using the R® programming language. The study population consisted of 7,085 adults aged 22 and 30 years, belonging to the 1993 and 1982 Pelotas-RS birth cohorts, respectively. Participants' BMI values were calculated and classified according to the World Health Organization. Individuals were further categorized based on %BF values into obese (≥ 25% for men and ≥ 32% for women) and non-obese (≤ 25% for men and ≤ 32% for women). After BMI and %BF classification, individuals with BMI < 25 kg/m² were also grouped into normal weight (BMI < 25 kg/m² and absence of obesity by %BF) and normalweight obese (BMI < 25 kg/m² and presence of obesity by %BF). Input variables considered were demographic data (sex and age) and anthropometric measures (weight, height, and waist (WC), hip (HC), right wrist, and right calf circumferences). The outcome was defined as the %BF measured by Dual-Energy X-ray Absorptiometry (DXA). For model construction, the Radial Support Vector Machine (SVM) algorithm was applied, adopting Linear Regression (LR) as a baseline. Data partitioning followed the Hold-Out method, with 80% comprising the training set and 20% the test set. Exclusively in the training group, the k-fold Cross-Validation technique was applied, adopting k = 5. To evaluate performance, the following metrics were calculated: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Normalized Root Mean Square Error (NRMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R²). Subsequently, in the best-performing model, the SHAP function was applied to select the most relevant variables for constructing two reduced versions: Reduced 1 (containing only the five most relevant variables based on SHAP calculation – sex, WC, HC, height, and weight, in order of importance) and Reduced 2 (including the variable age). The Bland-Altman plot and Intraclass Correlation Coefficient (ICC) were applied to evaluate the agreement between the %BF estimated by the models and the %BF measured by DXA. The frequency of overweight was 26.5% and obesity 14.1%, considering BMI. However, by %BF, 52.1% of individuals were classified as obese. Notably, normal-weight obese individuals corresponded to 17.3% of the general sample. All ML models yielded better results compared to LR, with the Radial SVM - Reduced 2 achieving the best performance and agreement (MAE = 2.98; RMSE = 3.72; NRMSE = 7.07; R² = 0.91, MAPE = 13.24% and ICC = 0.95 [95% CI: 0.947 – 0.957]). It is concluded that the developed ML models, using simple demographic and anthropometric variables, presented high performance and excellent agreement for estimating %BF in adults, being potentially applicable in epidemiological research and clinical settings, including those with scarce resources, especially in Primary Health Care. It is noteworthy that the Radial SVM – Reduced 2 model proved to be accurate and parsimonious, requiring a smaller number of variables for its application.Considerando a frequência crescente de obesidade no Brasil e no mundo, as limitações do Índice de Massa Corporal (IMC) em diferenciar massa magra de massa gorda, bem como a ausência de um método para avaliar o percentual de gordura corporal (%GC) de forma precisa e acessível, este estudo propôs- se a desenvolver modelos capazes de estimar o %GC de adultos, a partir de dados demográficos e medidas antropométricas simples, através de técnicas de Aprendizado de Máquina (AM). Todas as análises estatísticas e a construção dos modelos foram realizadas na linguagem de programação R®. A população do estudo foi constituída por 7.085 adultos com 22 e 30 anos, pertencentes às coortes de Pelotas-RS de 1993 e de 1982, respectivamente. Os valores do IMC dos participantes do estudo foram calculados e classificados conforme a Organização Mundial da Saúde. Os indivíduos foram categorizados, ainda, com base nos valores de %GC, em obesos (≥ 25% para homens e ≥ 32% para mulheres) e não obesos (≤ 25% para homens e ≤ 32% para mulheres). Após classificação do IMC e %GC, os indivíduos com IMC < 25 kg/m² também foram agrupados em peso normal (IMC < 25kg/m² e ausência de obesidade pelo %GC) e obeso de peso normal (IMC < 25kg/m² e presença de obesidade pelo %GC). Foram consideradas variáveis de entrada: dados demográficos (sexo e idade) e medidas antropométricas (peso, altura e circunferências da cintura (CC), quadril (CQ), punho direito e panturrilha direita). Enquanto o desfecho foi considerado o %GC mensurado pelo método da Absortometria de Raio-X de Dupla Energia (DXA). Para construção dos modelos, foi aplicado o algoritmo Máquina de Vetor de Suporte (Support Vector Machine - SVM) Radial, adotando-se a Regressão Linear (RL) como referência (baseline). A partição dos dados seguiu o método Hold-Out, em que 80% compuseram o conjunto de treino e 20% o conjunto de teste. Exclusivamente no grupo de treino, aplicou-se a técnica de validação cruzada com k partições (k-fold Cross Validation), adotando-se k = 5. Para avaliação do desempenho, foram calculadas as métricas: Erro Médio Absoluto (MAE), Raiz do Erro Quadrático Médio (RMSE), Raiz do Erro Quadrático Médio Normalizado (NRMSE), Erro Percentual Absoluto Médio (MAPE) e Coeficiente de Determinação (R²). Posteriormente, no modelo com melhor desempenho, aplicou-se a função SHAP para seleção das variáveis de maior relevância para construção de duas versões reduzidas: Reduzido 1 (somente com as cinco variáveis mais relevantes com base no cálculo de SHAP - sexo, CC, CQ, altura e peso, em ordem de importância) e Reduzido 2 (incluindo a variável idade). Aplicou-se o gráfico de Bland-Altman e o Coeficiente de Correlação Intraclasse (CCI) para avaliação da concordância entre o %GC estimado pelos modelos e o %GC mensurado pela DXA. A frequência de indivíduos com sobrepeso foi de 26,5% e de obesidade 14,1%, considerando o IMC. Já pelo %GC, 52,1% dos indivíduos foram classificados com obesidade. Destaca-se que, indivíduos obesos de peso normal corresponderam a 17,3% da amostra geral. Todos os modelos de AM tiveram melhores resultados quando comparados à RL, sendo o SVM Radial - Reduzido 2 o que obteve melhor desempenho e concordância (MAE = 2,98; RMSE = 3,72; NRMSE = 7,07; R² = 0,91, MAPE = 13,24% e CCI = 0,95 [IC 95%: 0,947 – 0,957]). Conclui-se que os modelos de AM desenvolvidos, utilizando variáveis demográficas e antropométricas simples, apresentaram alto desempenho e excelente concordância para estimação do %GC de adultos, sendo potencialmente aplicáveis em pesquisas epidemiológicas e em ambientes clínicos, inclusive naqueles com escassez de recursos, especialmente na Atenção Primária à Saúde. Ressalta-se que o modelo SVM Radial – Reduzido 2 mostrou-se preciso e parcimonioso, necessitando de um menor número de variáveis para sua aplicação.Submitted by Maria Aparecida (cidazen@gmail.com) on 2026-03-03T18:46:21Z No. of bitstreams: 1 HELOÍSA BAIMA DA SILVA SANTOS.pdf: 366832 bytes, checksum: 4618895d0442156d41db7d5b76fbdf9a (MD5)Made available in DSpace on 2026-03-03T18:46:21Z (GMT). No. of bitstreams: 1 HELOÍSA BAIMA DA SILVA SANTOS.pdf: 366832 bytes, checksum: 4618895d0442156d41db7d5b76fbdf9a (MD5) Previous issue date: 2026-02-11CAPESapplication/pdfporUniversidade Federal do MaranhãoPROGRAMA DE PÓS-GRADUAÇÃO EM SAÚDE COLETIVA/CCBSUFMABrasilCOORDENAÇÃO DO CURSO DE NUTRIÇÃO/CCBScomposição corporal;aprendizado de máquina;algoritmos;antropometriabody composition;machine learning;algorithms;anthropometryAnálise Nutricional de PopulaçãoUSO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROSUse of Machine Learning Techniques for Estimating Body Fat in Brazilian AdultsTrabalho sob Sigilo. Motivo: Em processo de publicação em periódico. Prazo para Liberação:12 meses.info:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UFMAinstname:Universidade Federal do Maranhão (UFMA)instacron:UFMAORIGINALHELOÍSA BAIMA DA SILVA SANTOS.pdfHELOÍSA BAIMA DA SILVA SANTOS.pdfapplication/pdf366832http://tedebc.ufma.br:8080/bitstream/tede/6815/2/HELO%C3%8DSA+BAIMA+DA+SILVA+SANTOS.pdf4618895d0442156d41db7d5b76fbdf9aMD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82255http://tedebc.ufma.br:8080/bitstream/tede/6815/1/license.txt97eeade1fce43278e63fe063657f8083MD51tede/68152026-03-03 15:48:57.748oai:tede2:tede/6815IExJQ0VOw4dBIERFIERJU1RSSUJVScOHw4NPIE7Dg08tRVhDTFVTSVZBCgpDb20gYSBhcHJlc2VudGHDp8OjbyBkZXN0YSBsaWNlbsOnYSxvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvciBjb25jZWRlIMOgIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRvIE1hcmFuaMOjbyAoVUZNQSkgbyBkaXJlaXRvIG7Do28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsIHRyYWR1emlyIChjb25mb3JtZSBkZWZpbmlkbyBhYmFpeG8pLCBlL291IGRpc3RyaWJ1aXIgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyw7RuaWNvIGUgZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zIMOhdWRpbyBvdSB2w61kZW8uCgpWb2PDqiBjb25jb3JkYSBxdWUgYSBVRk1BIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBhIFVGTUEgcG9kZSBtYW50ZXIgbWFpcyBkZSB1bWEgY8OzcGlhIGRlIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gcGFyYSBmaW5zIGRlIHNlZ3VyYW7Dp2EsIGJhY2stdXAgZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogZGVjbGFyYSBxdWUgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIMOpIG9yaWdpbmFsIGUgcXVlIHZvY8OqIHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIG8gZGVww7NzaXRvIGRhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gbsOjbywgcXVlIHNlamEgZGUgc2V1IGNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiBkZWNsYXJhIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgw6AgVUZNQSBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBURVNFIE9VIERJU1NFUlRBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UgQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PIFFVRSBOw4NPIFNFSkEgQSBVRk1BLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyBUQU1Cw4lNIEFTIERFTUFJUyBPQlJJR0HDh8OVRVMgRVhJR0lEQVMgUE9SIENPTlRSQVRPIE9VIEFDT1JETy4KCkEgVUZNQSBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lIG91IG8ocykgbm9tZShzKSBkbyhzKSBkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbywgZSBuw6NvIGZhcsOhIHF1YWxxdWVyIGFsdGVyYcOnw6NvLCBhbMOpbSBkYXF1ZWxhcyBjb25jZWRpZGFzIHBvciBlc3RhIGxpY2Vuw6dhLgoKRGVjbGFyYSB0YW1iw6ltIHF1ZSB0b2RhcyBhcyBhZmlsaWHDp8O1ZXMgY29ycG9yYXRpdmFzIG91IGluc3RpdHVjaW9uYWlzIGUgdG9kYXMgYXMgZm9udGVzIGRlIGFwb2lvIGZpbmFuY2Vpcm8gYW8gdHJhYmFsaG8gZXN0w6NvIGRldmlkYW1lbnRlIGNpdGFkYXMgb3UgbWVuY2lvbmFkYXMgZSBjZXJ0aWZpY2EgcXVlIG7Do28gaMOhIG5lbmh1bSBpbnRlcmVzc2UgY29tZXJjaWFsIG91IGFzc29jaWF0aXZvIHF1ZSByZXByZXNlbnRlIGNvbmZsaXRvIGRlIGludGVyZXNzZSBlbSBjb25leMOjbyBjb20gbyB0cmFiYWxobyBzdWJtZXRpZG8uCgoKCgoKCgo=Biblioteca Digital de Teses e Dissertaçõeshttps://tedebc.ufma.br/jspui/PUBhttp://tedebc.ufma.br:8080/oai/requestrepositorio@ufma.bropendoar:21312026-03-03T18:48:57Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA)false
dc.title.por.fl_str_mv USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
dc.title.alternative.eng.fl_str_mv Use of Machine Learning Techniques for Estimating Body Fat in Brazilian Adults
title USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
spellingShingle USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
SANTOS, Heloísa Baima da Silva
composição corporal;
aprendizado de máquina;
algoritmos;
antropometria
body composition;
machine learning;
algorithms;
anthropometry
Análise Nutricional de População
title_short USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
title_full USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
title_fullStr USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
title_full_unstemmed USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
title_sort USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
author SANTOS, Heloísa Baima da Silva
author_facet SANTOS, Heloísa Baima da Silva
author_role author
dc.contributor.advisor1.fl_str_mv FRANÇA, Ana Karina Teixeira da Cunha
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/8389486900285691
dc.contributor.advisor-co1.fl_str_mv SANTOS, Alcione Miranda dos
dc.contributor.advisor-co1Lattes.fl_str_mv http://lattes.cnpq.br/2709550775435326
dc.contributor.referee1.fl_str_mv FRANÇA, Ana Karina Teixeira da Cunha
dc.contributor.referee1Lattes.fl_str_mv http://lattes.cnpq.br/8389486900285691
dc.contributor.referee2.fl_str_mv SANTOS, Alcione Miranda dos
dc.contributor.referee2Lattes.fl_str_mv http://lattes.cnpq.br/2709550775435326
dc.contributor.referee3.fl_str_mv SOUSA JÚNIOR, Carlos Magno
dc.contributor.referee3Lattes.fl_str_mv http://lattes.cnpq.br/9561853644051629
dc.contributor.referee4.fl_str_mv NASCIMENTO, Joelma Ximenes Prado Teixeira
dc.contributor.referee4Lattes.fl_str_mv http://lattes.cnpq.br/9553463144721017
dc.contributor.referee5.fl_str_mv BOGÉA, Eduarda Gomes
dc.contributor.referee5Lattes.fl_str_mv http://lattes.cnpq.br/6613731210897851
dc.contributor.authorLattes.fl_str_mv http://lattes.cnpq.br/3636552632794234
dc.contributor.author.fl_str_mv SANTOS, Heloísa Baima da Silva
contributor_str_mv FRANÇA, Ana Karina Teixeira da Cunha
SANTOS, Alcione Miranda dos
FRANÇA, Ana Karina Teixeira da Cunha
SANTOS, Alcione Miranda dos
SOUSA JÚNIOR, Carlos Magno
NASCIMENTO, Joelma Ximenes Prado Teixeira
BOGÉA, Eduarda Gomes
dc.subject.por.fl_str_mv composição corporal;
aprendizado de máquina;
algoritmos;
antropometria
topic composição corporal;
aprendizado de máquina;
algoritmos;
antropometria
body composition;
machine learning;
algorithms;
anthropometry
Análise Nutricional de População
dc.subject.eng.fl_str_mv body composition;
machine learning;
algorithms;
anthropometry
dc.subject.cnpq.fl_str_mv Análise Nutricional de População
description Considering the rising prevalence of obesity in Brazil and worldwide, the limitations of Body Mass Index (BMI) in differentiating lean mass from fat mass, as well as the absence of a precise and accessible method to assess body fat percentage (%BF), this study aimed to develop models capable of estimating %BF in adults, using demographic data and simple anthropometric measures, through Machine Learning (ML) techniques. All statistical analyses and model construction were performed using the R® programming language. The study population consisted of 7,085 adults aged 22 and 30 years, belonging to the 1993 and 1982 Pelotas-RS birth cohorts, respectively. Participants' BMI values were calculated and classified according to the World Health Organization. Individuals were further categorized based on %BF values into obese (≥ 25% for men and ≥ 32% for women) and non-obese (≤ 25% for men and ≤ 32% for women). After BMI and %BF classification, individuals with BMI < 25 kg/m² were also grouped into normal weight (BMI < 25 kg/m² and absence of obesity by %BF) and normalweight obese (BMI < 25 kg/m² and presence of obesity by %BF). Input variables considered were demographic data (sex and age) and anthropometric measures (weight, height, and waist (WC), hip (HC), right wrist, and right calf circumferences). The outcome was defined as the %BF measured by Dual-Energy X-ray Absorptiometry (DXA). For model construction, the Radial Support Vector Machine (SVM) algorithm was applied, adopting Linear Regression (LR) as a baseline. Data partitioning followed the Hold-Out method, with 80% comprising the training set and 20% the test set. Exclusively in the training group, the k-fold Cross-Validation technique was applied, adopting k = 5. To evaluate performance, the following metrics were calculated: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Normalized Root Mean Square Error (NRMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R²). Subsequently, in the best-performing model, the SHAP function was applied to select the most relevant variables for constructing two reduced versions: Reduced 1 (containing only the five most relevant variables based on SHAP calculation – sex, WC, HC, height, and weight, in order of importance) and Reduced 2 (including the variable age). The Bland-Altman plot and Intraclass Correlation Coefficient (ICC) were applied to evaluate the agreement between the %BF estimated by the models and the %BF measured by DXA. The frequency of overweight was 26.5% and obesity 14.1%, considering BMI. However, by %BF, 52.1% of individuals were classified as obese. Notably, normal-weight obese individuals corresponded to 17.3% of the general sample. All ML models yielded better results compared to LR, with the Radial SVM - Reduced 2 achieving the best performance and agreement (MAE = 2.98; RMSE = 3.72; NRMSE = 7.07; R² = 0.91, MAPE = 13.24% and ICC = 0.95 [95% CI: 0.947 – 0.957]). It is concluded that the developed ML models, using simple demographic and anthropometric variables, presented high performance and excellent agreement for estimating %BF in adults, being potentially applicable in epidemiological research and clinical settings, including those with scarce resources, especially in Primary Health Care. It is noteworthy that the Radial SVM – Reduced 2 model proved to be accurate and parsimonious, requiring a smaller number of variables for its application.
publishDate 2026
dc.date.accessioned.fl_str_mv 2026-03-03T18:46:21Z
dc.date.issued.fl_str_mv 2026-02-11
dc.type.driver.fl_str_mv Trabalho sob Sigilo. Motivo: Em processo de publicação em periódico. Prazo para Liberação:12 meses.
info:eu-repo/semantics/masterThesis
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv SANTOS, Heloísa Baima da Silva. Uso de técnicas de aprendizado de máquina para estimação de gordura corporal em adultos brasileiros. 2026. 17 f. Dissertação( Programa de Pós-graduação em Saúde Coletiva/CCBS) - Universidade Federal do Maranhão, São Luís, 2026.
dc.identifier.uri.fl_str_mv https://tedebc.ufma.br/jspui/handle/tede/6815
identifier_str_mv SANTOS, Heloísa Baima da Silva. Uso de técnicas de aprendizado de máquina para estimação de gordura corporal em adultos brasileiros. 2026. 17 f. Dissertação( Programa de Pós-graduação em Saúde Coletiva/CCBS) - Universidade Federal do Maranhão, São Luís, 2026.
url https://tedebc.ufma.br/jspui/handle/tede/6815
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal do Maranhão
dc.publisher.program.fl_str_mv PROGRAMA DE PÓS-GRADUAÇÃO EM SAÚDE COLETIVA/CCBS
dc.publisher.initials.fl_str_mv UFMA
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv COORDENAÇÃO DO CURSO DE NUTRIÇÃO/CCBS
publisher.none.fl_str_mv Universidade Federal do Maranhão
dc.source.none.fl_str_mv reponame:Biblioteca Digital de Teses e Dissertações da UFMA
instname:Universidade Federal do Maranhão (UFMA)
instacron:UFMA
instname_str Universidade Federal do Maranhão (UFMA)
instacron_str UFMA
institution UFMA
reponame_str Biblioteca Digital de Teses e Dissertações da UFMA
collection Biblioteca Digital de Teses e Dissertações da UFMA
bitstream.url.fl_str_mv http://tedebc.ufma.br:8080/bitstream/tede/6815/2/HELO%C3%8DSA+BAIMA+DA+SILVA+SANTOS.pdf
http://tedebc.ufma.br:8080/bitstream/tede/6815/1/license.txt
bitstream.checksum.fl_str_mv 4618895d0442156d41db7d5b76fbdf9a
97eeade1fce43278e63fe063657f8083
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA)
repository.mail.fl_str_mv repositorio@ufma.br
_version_ 1863373532582379520