Diabetes Diagnosis through Machine Learning: An Analysis of Classification Algorithms

  • Haris Ahmed Karachi Institute of Economics and Technology Karachi
  • Muhammad Affan Alim
  • Waleej Haider
  • Muhammad Nadeem
  • Ahsan Masroor
  • Nadeem Qamar
Keywords: Logistic Regression, Naive Bayes, Decision Tree, Machine Learning, Logistic Regression, Diabetes


Diabetes is a serious and chronic disease characterized by high levels of sugar in the blood. If left untreated, it can
lead to numerous complications. In the past, diagnosing diabetes required a visit to a diagnostic center and
consultation with a doctor. However, the use of machine learning can help to identify the disease earlier and more
accurately. This study aimed to create a model that can accurately predict the likelihood of diabetes in patients using
three machine learning classification algorithms: Logistic Regression (LR), Decision Tree (DT), and Naive Bayes
(NB). The model was tested on the Pima Indians Diabetes Database (PIDD) from the UCI machine learning
repository and the performance of the algorithms was evaluated using various metrics such as accuracy, precision,
F-measure, and recall. The results showed that Logistic Regression had the highest accuracy at 71.39%
outperforming the other algorithms.

How to Cite
Ahmed, H., Affan Alim, M., Haider, W., Nadeem, M., Masroor, A., & Qamar, N. (2023). Diabetes Diagnosis through Machine Learning: An Analysis of Classification Algorithms. Lahore Garrison University Research Journal of Computer Science and Information Technology, 7(1), 29-34. https://doi.org/10.54692/lgurjcsit.2023.0701411