Comparison of logistics regression models in early detection of diabetes risk

  • Camilla Ramos da Silva Universidade de São Paulo – PECEGE
  • Luiz Fernando Caldeira Ribeiro UNEMAT

Abstract

Diabetes is a global public health problem affecting 463 million people, and recent studies project a 51% increase in the number of affected individuals by 2045. The rate of asymptomatic pre-diabetic individuals is also high, accounting for approximately 84% of cases, hindering timely intervention and treatment before the disease progresses to severe complications or even death. Early diagnosis of the disease proves beneficial in this scenario, and data science can contribute to achieving it. The aim of this study is to propose early prediction models for diabetes using supervised methods of Binary Logistic Regression and Multilevel Binary Logistic Regression, assessing which model yields more accurate results. This study builds upon previous research where various methodologies were applied, but none utilized a multilevel approach. Responses from a questionnaire administered to patients—both diabetic and healthy—at the Sylhet Diabetes Hospital in Bangladesh were utilized in the modeling process, containing inquiries related to symptoms commonly associated with diabetes diagnosis. This study yielded models with satisfactory results, indicating that multilevel modeling outperforms conventional Logistic Regression

Published
2024-09-28
How to Cite
Camilla Ramos da Silva, & Luiz Fernando Caldeira Ribeiro. (2024). Comparison of logistics regression models in early detection of diabetes risk. REVISTA CEREUS, 16(3), 131-150. Retrieved from http://ojs.unirg.edu.br/index.php/1/article/view/4908