USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS
| Ano de defesa: | 2026 |
|---|---|
| Autor(a) principal: | |
| Orientador(a): | |
| Banca de defesa: | , , , , |
| Tipo de documento: | Dissertação |
| Tipo de acesso: | Acesso aberto |
| Idioma: | por |
| Instituição de defesa: |
Universidade Federal do Maranhão
|
| Programa de Pós-Graduação: |
PROGRAMA DE PÓS-GRADUAÇÃO EM SAÚDE COLETIVA/CCBS
|
| Departamento: |
COORDENAÇÃO DO CURSO DE NUTRIÇÃO/CCBS
|
| País: |
Brasil
|
| Palavras-chave em Português: | |
| Palavras-chave em Inglês: | |
| Área do conhecimento CNPq: | |
| Link de acesso: | https://tedebc.ufma.br/jspui/handle/tede/6815 |
Resumo: | Considering the rising prevalence of obesity in Brazil and worldwide, the limitations of Body Mass Index (BMI) in differentiating lean mass from fat mass, as well as the absence of a precise and accessible method to assess body fat percentage (%BF), this study aimed to develop models capable of estimating %BF in adults, using demographic data and simple anthropometric measures, through Machine Learning (ML) techniques. All statistical analyses and model construction were performed using the R® programming language. The study population consisted of 7,085 adults aged 22 and 30 years, belonging to the 1993 and 1982 Pelotas-RS birth cohorts, respectively. Participants' BMI values were calculated and classified according to the World Health Organization. Individuals were further categorized based on %BF values into obese (≥ 25% for men and ≥ 32% for women) and non-obese (≤ 25% for men and ≤ 32% for women). After BMI and %BF classification, individuals with BMI < 25 kg/m² were also grouped into normal weight (BMI < 25 kg/m² and absence of obesity by %BF) and normalweight obese (BMI < 25 kg/m² and presence of obesity by %BF). Input variables considered were demographic data (sex and age) and anthropometric measures (weight, height, and waist (WC), hip (HC), right wrist, and right calf circumferences). The outcome was defined as the %BF measured by Dual-Energy X-ray Absorptiometry (DXA). For model construction, the Radial Support Vector Machine (SVM) algorithm was applied, adopting Linear Regression (LR) as a baseline. Data partitioning followed the Hold-Out method, with 80% comprising the training set and 20% the test set. Exclusively in the training group, the k-fold Cross-Validation technique was applied, adopting k = 5. To evaluate performance, the following metrics were calculated: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Normalized Root Mean Square Error (NRMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R²). Subsequently, in the best-performing model, the SHAP function was applied to select the most relevant variables for constructing two reduced versions: Reduced 1 (containing only the five most relevant variables based on SHAP calculation – sex, WC, HC, height, and weight, in order of importance) and Reduced 2 (including the variable age). The Bland-Altman plot and Intraclass Correlation Coefficient (ICC) were applied to evaluate the agreement between the %BF estimated by the models and the %BF measured by DXA. The frequency of overweight was 26.5% and obesity 14.1%, considering BMI. However, by %BF, 52.1% of individuals were classified as obese. Notably, normal-weight obese individuals corresponded to 17.3% of the general sample. All ML models yielded better results compared to LR, with the Radial SVM - Reduced 2 achieving the best performance and agreement (MAE = 2.98; RMSE = 3.72; NRMSE = 7.07; R² = 0.91, MAPE = 13.24% and ICC = 0.95 [95% CI: 0.947 – 0.957]). It is concluded that the developed ML models, using simple demographic and anthropometric variables, presented high performance and excellent agreement for estimating %BF in adults, being potentially applicable in epidemiological research and clinical settings, including those with scarce resources, especially in Primary Health Care. It is noteworthy that the Radial SVM – Reduced 2 model proved to be accurate and parsimonious, requiring a smaller number of variables for its application. |
| id |
UFMA_19ea351c7f2ab664ace8b2b5bc54fed2 |
|---|---|
| oai_identifier_str |
oai:tede2:tede/6815 |
| network_acronym_str |
UFMA |
| network_name_str |
Biblioteca Digital de Teses e Dissertações da UFMA |
| repository_id_str |
|
| spelling |
FRANÇA, Ana Karina Teixeira da Cunhahttp://lattes.cnpq.br/8389486900285691SANTOS, Alcione Miranda doshttp://lattes.cnpq.br/2709550775435326FRANÇA, Ana Karina Teixeira da Cunhahttp://lattes.cnpq.br/8389486900285691SANTOS, Alcione Miranda doshttp://lattes.cnpq.br/2709550775435326SOUSA JÚNIOR, Carlos Magnohttp://lattes.cnpq.br/9561853644051629NASCIMENTO, Joelma Ximenes Prado Teixeirahttp://lattes.cnpq.br/9553463144721017BOGÉA, Eduarda Gomeshttp://lattes.cnpq.br/6613731210897851http://lattes.cnpq.br/3636552632794234SANTOS, Heloísa Baima da Silva2026-03-03T18:46:21Z2026-02-11SANTOS, Heloísa Baima da Silva. Uso de técnicas de aprendizado de máquina para estimação de gordura corporal em adultos brasileiros. 2026. 17 f. Dissertação( Programa de Pós-graduação em Saúde Coletiva/CCBS) - Universidade Federal do Maranhão, São Luís, 2026.https://tedebc.ufma.br/jspui/handle/tede/6815Considering the rising prevalence of obesity in Brazil and worldwide, the limitations of Body Mass Index (BMI) in differentiating lean mass from fat mass, as well as the absence of a precise and accessible method to assess body fat percentage (%BF), this study aimed to develop models capable of estimating %BF in adults, using demographic data and simple anthropometric measures, through Machine Learning (ML) techniques. All statistical analyses and model construction were performed using the R® programming language. The study population consisted of 7,085 adults aged 22 and 30 years, belonging to the 1993 and 1982 Pelotas-RS birth cohorts, respectively. Participants' BMI values were calculated and classified according to the World Health Organization. Individuals were further categorized based on %BF values into obese (≥ 25% for men and ≥ 32% for women) and non-obese (≤ 25% for men and ≤ 32% for women). After BMI and %BF classification, individuals with BMI < 25 kg/m² were also grouped into normal weight (BMI < 25 kg/m² and absence of obesity by %BF) and normalweight obese (BMI < 25 kg/m² and presence of obesity by %BF). Input variables considered were demographic data (sex and age) and anthropometric measures (weight, height, and waist (WC), hip (HC), right wrist, and right calf circumferences). The outcome was defined as the %BF measured by Dual-Energy X-ray Absorptiometry (DXA). For model construction, the Radial Support Vector Machine (SVM) algorithm was applied, adopting Linear Regression (LR) as a baseline. Data partitioning followed the Hold-Out method, with 80% comprising the training set and 20% the test set. Exclusively in the training group, the k-fold Cross-Validation technique was applied, adopting k = 5. To evaluate performance, the following metrics were calculated: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Normalized Root Mean Square Error (NRMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R²). Subsequently, in the best-performing model, the SHAP function was applied to select the most relevant variables for constructing two reduced versions: Reduced 1 (containing only the five most relevant variables based on SHAP calculation – sex, WC, HC, height, and weight, in order of importance) and Reduced 2 (including the variable age). The Bland-Altman plot and Intraclass Correlation Coefficient (ICC) were applied to evaluate the agreement between the %BF estimated by the models and the %BF measured by DXA. The frequency of overweight was 26.5% and obesity 14.1%, considering BMI. However, by %BF, 52.1% of individuals were classified as obese. Notably, normal-weight obese individuals corresponded to 17.3% of the general sample. All ML models yielded better results compared to LR, with the Radial SVM - Reduced 2 achieving the best performance and agreement (MAE = 2.98; RMSE = 3.72; NRMSE = 7.07; R² = 0.91, MAPE = 13.24% and ICC = 0.95 [95% CI: 0.947 – 0.957]). It is concluded that the developed ML models, using simple demographic and anthropometric variables, presented high performance and excellent agreement for estimating %BF in adults, being potentially applicable in epidemiological research and clinical settings, including those with scarce resources, especially in Primary Health Care. It is noteworthy that the Radial SVM – Reduced 2 model proved to be accurate and parsimonious, requiring a smaller number of variables for its application.Considerando a frequência crescente de obesidade no Brasil e no mundo, as limitações do Índice de Massa Corporal (IMC) em diferenciar massa magra de massa gorda, bem como a ausência de um método para avaliar o percentual de gordura corporal (%GC) de forma precisa e acessível, este estudo propôs- se a desenvolver modelos capazes de estimar o %GC de adultos, a partir de dados demográficos e medidas antropométricas simples, através de técnicas de Aprendizado de Máquina (AM). Todas as análises estatísticas e a construção dos modelos foram realizadas na linguagem de programação R®. A população do estudo foi constituída por 7.085 adultos com 22 e 30 anos, pertencentes às coortes de Pelotas-RS de 1993 e de 1982, respectivamente. Os valores do IMC dos participantes do estudo foram calculados e classificados conforme a Organização Mundial da Saúde. Os indivíduos foram categorizados, ainda, com base nos valores de %GC, em obesos (≥ 25% para homens e ≥ 32% para mulheres) e não obesos (≤ 25% para homens e ≤ 32% para mulheres). Após classificação do IMC e %GC, os indivíduos com IMC < 25 kg/m² também foram agrupados em peso normal (IMC < 25kg/m² e ausência de obesidade pelo %GC) e obeso de peso normal (IMC < 25kg/m² e presença de obesidade pelo %GC). Foram consideradas variáveis de entrada: dados demográficos (sexo e idade) e medidas antropométricas (peso, altura e circunferências da cintura (CC), quadril (CQ), punho direito e panturrilha direita). Enquanto o desfecho foi considerado o %GC mensurado pelo método da Absortometria de Raio-X de Dupla Energia (DXA). Para construção dos modelos, foi aplicado o algoritmo Máquina de Vetor de Suporte (Support Vector Machine - SVM) Radial, adotando-se a Regressão Linear (RL) como referência (baseline). A partição dos dados seguiu o método Hold-Out, em que 80% compuseram o conjunto de treino e 20% o conjunto de teste. Exclusivamente no grupo de treino, aplicou-se a técnica de validação cruzada com k partições (k-fold Cross Validation), adotando-se k = 5. Para avaliação do desempenho, foram calculadas as métricas: Erro Médio Absoluto (MAE), Raiz do Erro Quadrático Médio (RMSE), Raiz do Erro Quadrático Médio Normalizado (NRMSE), Erro Percentual Absoluto Médio (MAPE) e Coeficiente de Determinação (R²). Posteriormente, no modelo com melhor desempenho, aplicou-se a função SHAP para seleção das variáveis de maior relevância para construção de duas versões reduzidas: Reduzido 1 (somente com as cinco variáveis mais relevantes com base no cálculo de SHAP - sexo, CC, CQ, altura e peso, em ordem de importância) e Reduzido 2 (incluindo a variável idade). Aplicou-se o gráfico de Bland-Altman e o Coeficiente de Correlação Intraclasse (CCI) para avaliação da concordância entre o %GC estimado pelos modelos e o %GC mensurado pela DXA. A frequência de indivíduos com sobrepeso foi de 26,5% e de obesidade 14,1%, considerando o IMC. Já pelo %GC, 52,1% dos indivíduos foram classificados com obesidade. Destaca-se que, indivíduos obesos de peso normal corresponderam a 17,3% da amostra geral. Todos os modelos de AM tiveram melhores resultados quando comparados à RL, sendo o SVM Radial - Reduzido 2 o que obteve melhor desempenho e concordância (MAE = 2,98; RMSE = 3,72; NRMSE = 7,07; R² = 0,91, MAPE = 13,24% e CCI = 0,95 [IC 95%: 0,947 – 0,957]). Conclui-se que os modelos de AM desenvolvidos, utilizando variáveis demográficas e antropométricas simples, apresentaram alto desempenho e excelente concordância para estimação do %GC de adultos, sendo potencialmente aplicáveis em pesquisas epidemiológicas e em ambientes clínicos, inclusive naqueles com escassez de recursos, especialmente na Atenção Primária à Saúde. Ressalta-se que o modelo SVM Radial – Reduzido 2 mostrou-se preciso e parcimonioso, necessitando de um menor número de variáveis para sua aplicação.Submitted by Maria Aparecida (cidazen@gmail.com) on 2026-03-03T18:46:21Z No. of bitstreams: 1 HELOÍSA BAIMA DA SILVA SANTOS.pdf: 366832 bytes, checksum: 4618895d0442156d41db7d5b76fbdf9a (MD5)Made available in DSpace on 2026-03-03T18:46:21Z (GMT). No. of bitstreams: 1 HELOÍSA BAIMA DA SILVA SANTOS.pdf: 366832 bytes, checksum: 4618895d0442156d41db7d5b76fbdf9a (MD5) Previous issue date: 2026-02-11CAPESapplication/pdfporUniversidade Federal do MaranhãoPROGRAMA DE PÓS-GRADUAÇÃO EM SAÚDE COLETIVA/CCBSUFMABrasilCOORDENAÇÃO DO CURSO DE NUTRIÇÃO/CCBScomposição corporal;aprendizado de máquina;algoritmos;antropometriabody composition;machine learning;algorithms;anthropometryAnálise Nutricional de PopulaçãoUSO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROSUse of Machine Learning Techniques for Estimating Body Fat in Brazilian AdultsTrabalho sob Sigilo. Motivo: Em processo de publicação em periódico. Prazo para Liberação:12 meses.info:eu-repo/semantics/masterThesisinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UFMAinstname:Universidade Federal do Maranhão (UFMA)instacron:UFMAORIGINALHELOÍSA BAIMA DA SILVA SANTOS.pdfHELOÍSA BAIMA DA SILVA SANTOS.pdfapplication/pdf366832http://tedebc.ufma.br:8080/bitstream/tede/6815/2/HELO%C3%8DSA+BAIMA+DA+SILVA+SANTOS.pdf4618895d0442156d41db7d5b76fbdf9aMD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82255http://tedebc.ufma.br:8080/bitstream/tede/6815/1/license.txt97eeade1fce43278e63fe063657f8083MD51tede/68152026-03-03 15:48:57.748oai:tede2:tede/6815IExJQ0VOw4dBIERFIERJU1RSSUJVScOHw4NPIE7Dg08tRVhDTFVTSVZBCgpDb20gYSBhcHJlc2VudGHDp8OjbyBkZXN0YSBsaWNlbsOnYSxvIGF1dG9yIChlcykgb3UgbyB0aXR1bGFyIGRvcyBkaXJlaXRvcyBkZSBhdXRvciBjb25jZWRlIMOgIFVuaXZlcnNpZGFkZSBGZWRlcmFsIGRvIE1hcmFuaMOjbyAoVUZNQSkgbyBkaXJlaXRvIG7Do28tZXhjbHVzaXZvIGRlIHJlcHJvZHV6aXIsIHRyYWR1emlyIChjb25mb3JtZSBkZWZpbmlkbyBhYmFpeG8pLCBlL291IGRpc3RyaWJ1aXIgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIChpbmNsdWluZG8gbyByZXN1bW8pIHBvciB0b2RvIG8gbXVuZG8gbm8gZm9ybWF0byBpbXByZXNzbyBlIGVsZXRyw7RuaWNvIGUgZW0gcXVhbHF1ZXIgbWVpbywgaW5jbHVpbmRvIG9zIGZvcm1hdG9zIMOhdWRpbyBvdSB2w61kZW8uCgpWb2PDqiBjb25jb3JkYSBxdWUgYSBVRk1BIHBvZGUsIHNlbSBhbHRlcmFyIG8gY29udGXDumRvLCB0cmFuc3BvciBhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gcGFyYSBxdWFscXVlciBtZWlvIG91IGZvcm1hdG8gcGFyYSBmaW5zIGRlIHByZXNlcnZhw6fDo28uCgpWb2PDqiB0YW1iw6ltIGNvbmNvcmRhIHF1ZSBhIFVGTUEgcG9kZSBtYW50ZXIgbWFpcyBkZSB1bWEgY8OzcGlhIGRlIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gcGFyYSBmaW5zIGRlIHNlZ3VyYW7Dp2EsIGJhY2stdXAgZSBwcmVzZXJ2YcOnw6NvLgoKVm9jw6ogZGVjbGFyYSBxdWUgYSBzdWEgdGVzZSBvdSBkaXNzZXJ0YcOnw6NvIMOpIG9yaWdpbmFsIGUgcXVlIHZvY8OqIHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vuw6dhLiBWb2PDqiB0YW1iw6ltIGRlY2xhcmEgcXVlIG8gZGVww7NzaXRvIGRhIHN1YSB0ZXNlIG91IGRpc3NlcnRhw6fDo28gbsOjbywgcXVlIHNlamEgZGUgc2V1IGNvbmhlY2ltZW50bywgaW5mcmluZ2UgZGlyZWl0b3MgYXV0b3JhaXMgZGUgbmluZ3XDqW0uCgpDYXNvIGEgc3VhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBjb250ZW5oYSBtYXRlcmlhbCBxdWUgdm9jw6ogbsOjbyBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2PDqiBkZWNsYXJhIHF1ZSBvYnRldmUgYSBwZXJtaXNzw6NvIGlycmVzdHJpdGEgZG8gZGV0ZW50b3IgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIHBhcmEgY29uY2VkZXIgw6AgVUZNQSBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7Dp2EsIGUgcXVlIGVzc2UgbWF0ZXJpYWwgZGUgcHJvcHJpZWRhZGUgZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3Ugbm8gY29udGXDumRvIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbyBvcmEgZGVwb3NpdGFkYS4KCkNBU08gQSBURVNFIE9VIERJU1NFUlRBw4fDg08gT1JBIERFUE9TSVRBREEgVEVOSEEgU0lETyBSRVNVTFRBRE8gREUgVU0gUEFUUk9Dw41OSU8gT1UgQVBPSU8gREUgVU1BIEFHw4pOQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PIFFVRSBOw4NPIFNFSkEgQSBVRk1BLCBWT0PDiiBERUNMQVJBIFFVRSBSRVNQRUlUT1UgVE9ET1MgRSBRVUFJU1FVRVIgRElSRUlUT1MgREUgUkVWSVPDg08gQ09NTyBUQU1Cw4lNIEFTIERFTUFJUyBPQlJJR0HDh8OVRVMgRVhJR0lEQVMgUE9SIENPTlRSQVRPIE9VIEFDT1JETy4KCkEgVUZNQSBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lIG91IG8ocykgbm9tZShzKSBkbyhzKSBkZXRlbnRvcihlcykgZG9zIGRpcmVpdG9zIGF1dG9yYWlzIGRhIHRlc2Ugb3UgZGlzc2VydGHDp8OjbywgZSBuw6NvIGZhcsOhIHF1YWxxdWVyIGFsdGVyYcOnw6NvLCBhbMOpbSBkYXF1ZWxhcyBjb25jZWRpZGFzIHBvciBlc3RhIGxpY2Vuw6dhLgoKRGVjbGFyYSB0YW1iw6ltIHF1ZSB0b2RhcyBhcyBhZmlsaWHDp8O1ZXMgY29ycG9yYXRpdmFzIG91IGluc3RpdHVjaW9uYWlzIGUgdG9kYXMgYXMgZm9udGVzIGRlIGFwb2lvIGZpbmFuY2Vpcm8gYW8gdHJhYmFsaG8gZXN0w6NvIGRldmlkYW1lbnRlIGNpdGFkYXMgb3UgbWVuY2lvbmFkYXMgZSBjZXJ0aWZpY2EgcXVlIG7Do28gaMOhIG5lbmh1bSBpbnRlcmVzc2UgY29tZXJjaWFsIG91IGFzc29jaWF0aXZvIHF1ZSByZXByZXNlbnRlIGNvbmZsaXRvIGRlIGludGVyZXNzZSBlbSBjb25leMOjbyBjb20gbyB0cmFiYWxobyBzdWJtZXRpZG8uCgoKCgoKCgo=Biblioteca Digital de Teses e Dissertaçõeshttps://tedebc.ufma.br/jspui/PUBhttp://tedebc.ufma.br:8080/oai/requestrepositorio@ufma.bropendoar:21312026-03-03T18:48:57Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA)false |
| dc.title.por.fl_str_mv |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS |
| dc.title.alternative.eng.fl_str_mv |
Use of Machine Learning Techniques for Estimating Body Fat in Brazilian Adults |
| title |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS |
| spellingShingle |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS SANTOS, Heloísa Baima da Silva composição corporal; aprendizado de máquina; algoritmos; antropometria body composition; machine learning; algorithms; anthropometry Análise Nutricional de População |
| title_short |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS |
| title_full |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS |
| title_fullStr |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS |
| title_full_unstemmed |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS |
| title_sort |
USO DE TÉCNICAS DE APRENDIZADO DE MÁQUINA PARA ESTIMAÇÃO DE GORDURA CORPORAL EM ADULTOS BRASILEIROS |
| author |
SANTOS, Heloísa Baima da Silva |
| author_facet |
SANTOS, Heloísa Baima da Silva |
| author_role |
author |
| dc.contributor.advisor1.fl_str_mv |
FRANÇA, Ana Karina Teixeira da Cunha |
| dc.contributor.advisor1Lattes.fl_str_mv |
http://lattes.cnpq.br/8389486900285691 |
| dc.contributor.advisor-co1.fl_str_mv |
SANTOS, Alcione Miranda dos |
| dc.contributor.advisor-co1Lattes.fl_str_mv |
http://lattes.cnpq.br/2709550775435326 |
| dc.contributor.referee1.fl_str_mv |
FRANÇA, Ana Karina Teixeira da Cunha |
| dc.contributor.referee1Lattes.fl_str_mv |
http://lattes.cnpq.br/8389486900285691 |
| dc.contributor.referee2.fl_str_mv |
SANTOS, Alcione Miranda dos |
| dc.contributor.referee2Lattes.fl_str_mv |
http://lattes.cnpq.br/2709550775435326 |
| dc.contributor.referee3.fl_str_mv |
SOUSA JÚNIOR, Carlos Magno |
| dc.contributor.referee3Lattes.fl_str_mv |
http://lattes.cnpq.br/9561853644051629 |
| dc.contributor.referee4.fl_str_mv |
NASCIMENTO, Joelma Ximenes Prado Teixeira |
| dc.contributor.referee4Lattes.fl_str_mv |
http://lattes.cnpq.br/9553463144721017 |
| dc.contributor.referee5.fl_str_mv |
BOGÉA, Eduarda Gomes |
| dc.contributor.referee5Lattes.fl_str_mv |
http://lattes.cnpq.br/6613731210897851 |
| dc.contributor.authorLattes.fl_str_mv |
http://lattes.cnpq.br/3636552632794234 |
| dc.contributor.author.fl_str_mv |
SANTOS, Heloísa Baima da Silva |
| contributor_str_mv |
FRANÇA, Ana Karina Teixeira da Cunha SANTOS, Alcione Miranda dos FRANÇA, Ana Karina Teixeira da Cunha SANTOS, Alcione Miranda dos SOUSA JÚNIOR, Carlos Magno NASCIMENTO, Joelma Ximenes Prado Teixeira BOGÉA, Eduarda Gomes |
| dc.subject.por.fl_str_mv |
composição corporal; aprendizado de máquina; algoritmos; antropometria |
| topic |
composição corporal; aprendizado de máquina; algoritmos; antropometria body composition; machine learning; algorithms; anthropometry Análise Nutricional de População |
| dc.subject.eng.fl_str_mv |
body composition; machine learning; algorithms; anthropometry |
| dc.subject.cnpq.fl_str_mv |
Análise Nutricional de População |
| description |
Considering the rising prevalence of obesity in Brazil and worldwide, the limitations of Body Mass Index (BMI) in differentiating lean mass from fat mass, as well as the absence of a precise and accessible method to assess body fat percentage (%BF), this study aimed to develop models capable of estimating %BF in adults, using demographic data and simple anthropometric measures, through Machine Learning (ML) techniques. All statistical analyses and model construction were performed using the R® programming language. The study population consisted of 7,085 adults aged 22 and 30 years, belonging to the 1993 and 1982 Pelotas-RS birth cohorts, respectively. Participants' BMI values were calculated and classified according to the World Health Organization. Individuals were further categorized based on %BF values into obese (≥ 25% for men and ≥ 32% for women) and non-obese (≤ 25% for men and ≤ 32% for women). After BMI and %BF classification, individuals with BMI < 25 kg/m² were also grouped into normal weight (BMI < 25 kg/m² and absence of obesity by %BF) and normalweight obese (BMI < 25 kg/m² and presence of obesity by %BF). Input variables considered were demographic data (sex and age) and anthropometric measures (weight, height, and waist (WC), hip (HC), right wrist, and right calf circumferences). The outcome was defined as the %BF measured by Dual-Energy X-ray Absorptiometry (DXA). For model construction, the Radial Support Vector Machine (SVM) algorithm was applied, adopting Linear Regression (LR) as a baseline. Data partitioning followed the Hold-Out method, with 80% comprising the training set and 20% the test set. Exclusively in the training group, the k-fold Cross-Validation technique was applied, adopting k = 5. To evaluate performance, the following metrics were calculated: Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Normalized Root Mean Square Error (NRMSE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R²). Subsequently, in the best-performing model, the SHAP function was applied to select the most relevant variables for constructing two reduced versions: Reduced 1 (containing only the five most relevant variables based on SHAP calculation – sex, WC, HC, height, and weight, in order of importance) and Reduced 2 (including the variable age). The Bland-Altman plot and Intraclass Correlation Coefficient (ICC) were applied to evaluate the agreement between the %BF estimated by the models and the %BF measured by DXA. The frequency of overweight was 26.5% and obesity 14.1%, considering BMI. However, by %BF, 52.1% of individuals were classified as obese. Notably, normal-weight obese individuals corresponded to 17.3% of the general sample. All ML models yielded better results compared to LR, with the Radial SVM - Reduced 2 achieving the best performance and agreement (MAE = 2.98; RMSE = 3.72; NRMSE = 7.07; R² = 0.91, MAPE = 13.24% and ICC = 0.95 [95% CI: 0.947 – 0.957]). It is concluded that the developed ML models, using simple demographic and anthropometric variables, presented high performance and excellent agreement for estimating %BF in adults, being potentially applicable in epidemiological research and clinical settings, including those with scarce resources, especially in Primary Health Care. It is noteworthy that the Radial SVM – Reduced 2 model proved to be accurate and parsimonious, requiring a smaller number of variables for its application. |
| publishDate |
2026 |
| dc.date.accessioned.fl_str_mv |
2026-03-03T18:46:21Z |
| dc.date.issued.fl_str_mv |
2026-02-11 |
| dc.type.driver.fl_str_mv |
Trabalho sob Sigilo. Motivo: Em processo de publicação em periódico. Prazo para Liberação:12 meses. info:eu-repo/semantics/masterThesis |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| format |
masterThesis |
| status_str |
publishedVersion |
| dc.identifier.citation.fl_str_mv |
SANTOS, Heloísa Baima da Silva. Uso de técnicas de aprendizado de máquina para estimação de gordura corporal em adultos brasileiros. 2026. 17 f. Dissertação( Programa de Pós-graduação em Saúde Coletiva/CCBS) - Universidade Federal do Maranhão, São Luís, 2026. |
| dc.identifier.uri.fl_str_mv |
https://tedebc.ufma.br/jspui/handle/tede/6815 |
| identifier_str_mv |
SANTOS, Heloísa Baima da Silva. Uso de técnicas de aprendizado de máquina para estimação de gordura corporal em adultos brasileiros. 2026. 17 f. Dissertação( Programa de Pós-graduação em Saúde Coletiva/CCBS) - Universidade Federal do Maranhão, São Luís, 2026. |
| url |
https://tedebc.ufma.br/jspui/handle/tede/6815 |
| dc.language.iso.fl_str_mv |
por |
| language |
por |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.publisher.none.fl_str_mv |
Universidade Federal do Maranhão |
| dc.publisher.program.fl_str_mv |
PROGRAMA DE PÓS-GRADUAÇÃO EM SAÚDE COLETIVA/CCBS |
| dc.publisher.initials.fl_str_mv |
UFMA |
| dc.publisher.country.fl_str_mv |
Brasil |
| dc.publisher.department.fl_str_mv |
COORDENAÇÃO DO CURSO DE NUTRIÇÃO/CCBS |
| publisher.none.fl_str_mv |
Universidade Federal do Maranhão |
| dc.source.none.fl_str_mv |
reponame:Biblioteca Digital de Teses e Dissertações da UFMA instname:Universidade Federal do Maranhão (UFMA) instacron:UFMA |
| instname_str |
Universidade Federal do Maranhão (UFMA) |
| instacron_str |
UFMA |
| institution |
UFMA |
| reponame_str |
Biblioteca Digital de Teses e Dissertações da UFMA |
| collection |
Biblioteca Digital de Teses e Dissertações da UFMA |
| bitstream.url.fl_str_mv |
http://tedebc.ufma.br:8080/bitstream/tede/6815/2/HELO%C3%8DSA+BAIMA+DA+SILVA+SANTOS.pdf http://tedebc.ufma.br:8080/bitstream/tede/6815/1/license.txt |
| bitstream.checksum.fl_str_mv |
4618895d0442156d41db7d5b76fbdf9a 97eeade1fce43278e63fe063657f8083 |
| bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 |
| repository.name.fl_str_mv |
Biblioteca Digital de Teses e Dissertações da UFMA - Universidade Federal do Maranhão (UFMA) |
| repository.mail.fl_str_mv |
repositorio@ufma.br |
| _version_ |
1863373532582379520 |