Comparative Analysis of Machine Learning Models for Diabetes Prediction
Patil P1*, Lawand D2, Gambas M3, Gaikwad D4
DOI:10.5281/zenodo.15867611
1* Pratiksha Patil, Department of Mathematics, Ramsheth Thakur College of Commerce and Science, Kharghar, Maharashtra, India.
2 Deepali Lawand, Department of Mathematics, Ramsheth Thakur College of Commerce and Science, Kharghar, Maharashtra, India.
3 Mohit Gambas, Ramsheth Thakur College of Commerce and Science, Kharghar, Maharashtra, India.
4 Deepak Gaikwad, Department of Physics, KTSP Mandal’s KMC College, Khopoli, Maharashtra, India.
Diabetes is a chronic health condition affecting millions worldwide, and early detection plays a vital role in effective disease management and prevention. In this study, we conduct a comparative analysis of four machine learning models—Logistic Regression, Random Forest, Gradient Boosting, and Linear Regression—applied to the Pima Indian Diabetes dataset obtained from Kaggle. The dataset comprises diagnostic measurements of female patients aged 21 and above of Pima Indian heritage. Each model is evaluated using key classification metrics, including accuracy, precision, recall, and F1-score. Among the models, Logistic Regression and Gradient Boosting achieved the highest accuracy of 75%, while Random Forest and Linear Regression showed slightly lower performance at 72% and 73.16%, respectively. The study highlights the effectiveness of ensemble methods and traditional classifiers in predicting diabetes outcomes and provides insight into their relative strengths for clinical decision support systems. These results suggest that machine learning can be a valuable tool in aiding early diagnosis and improving patient care strategies.
Keywords: Diabetes Prediction, Machine Learning, Logistic Regression, Random Forest, Gradient Boosting, Linear Regression, Predictive Analytics.
| Corresponding Author | How to Cite this Article | To Browse |
|---|---|---|
| , Department of Mathematics, Ramsheth Thakur College of Commerce and Science, Kharghar, Maharashtra, India. Email: |
Patil P, Lawand D, Gambas M, Gaikwad D, Comparative Analysis of Machine Learning Models for Diabetes Prediction. Int J Engg Mgmt Res. 2025;15(3):89-93. Available From https://ijemr.vandanapublications.com/index.php/j/article/view/1771 |


©