AI-Powered Early Detection of Diabetes Using Machine Learning on Electronic Health Records

Authors

  • Sagar Pukale Department of Computer Science & Engineering, MIT ADT University, Pune, Maharashtra, India Author
  • Sudarshan Jagdale Department of Computer Science & Engineering, MIT ADT University, Pune, Maharashtra, India Author
  • Anushaka Bhandari Department of Computer Science & Engineering, MIT ADT University, Pune, Maharashtra, India Author
  • Sourabh Shinde Department of Computer Science & Engineering, MIT ADT University, Pune, Maharashtra, India Author

DOI:

https://doi.org/10.32628/IJSRSET25122171

Keywords:

Diabetes Prediction, Electronic Health Records (EHRs), Explainable AI (XAI), Predictive Analytics, Healthcare AI, Medical Data Mining

Abstract

Millions of people worldwide suffer from diabetes, a chronic illness that must be identified early in order to be effectively managed and complications avoided. Conventional diagnostic techniques depend on recurring clinical evaluations, which could postpone prompt action. This study investigates the use of machine learning (ML) methods for early diabetes detection in electronic health records (EHRs). To increase predictive accuracy, we offer an optimized machine learning framework that makes use of the patient's medical history, test results, and lifestyle choices. Results from experiments show that ML models perform better than traditional diagnostic techniques in terms of overall predictive performance, sensitivity, and specificity. Lastly, we go over potential future paths, such as combining explainable AI and deep learning to improve decision-making.

Downloads

Download data is not yet available.

References

Hossain, M. J., Al‐Mamun, M., & Islam, M. R. (2024). Diabetes mellitus, the fastest growing global public health concern: Early detection should be focused. Health Science Reports, 7(3), e2004.

Soomro, M. H., & Jabbar, A. (2024). Diabetes etiopathology, classification, diagnosis, and epidemiology. In BIDE's Diabetes Desk Book (pp. 19-42). Elsevier.

Bergman, M., Manco, M., Satman, I., Chan, J., Schmidt, M. I., Sesti, G., ... & Tuomilehto, J. (2024). International Diabetes Federation Position Statement on the 1-hour post-load plasma glucose for the diagnosis of intermediate hyperglycaemia and type 2 diabetes. Diabetes research and clinical practice, 209, 111589.

Al-Jawaldeh, A., Taktouk, M., Hammerich, A., Ibrahim, E. T., Nawaiseh, H., Al-Jawaldeh, H., & Faris, M. E. Nutritional Management of Diabetes: Advocacy Guide.

Forouhi, N. G., & Wareham, N. J. (2019). Epidemiology of diabetes. Medicine, 47(1), 22-27.

Kale, D. R., & Mulla, J. M. S. (2024). AI in healthcare: Enhancing patient outcomes through predictive analytics. Industrial Engineering Journal, 53(5), 73-79. Industrial Engineering Journal.

Miotto, P., Tessema, B., Tagliani, E., Chindelevitch, L., Starks, M., Emerson, C., ... & Rodwell, T. C. (2017). A standardised method for interpreting the association between mutations and phenotypic drug resistance in Mycobacterium tuberculosis. European Respiratory Journal, 50(6).

Ren, X., Wen, W., Fan, X., Hou, W., Su, B., Cai, P., ... & Zhang, Z. (2021). COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas. Cell, 184(7), 1895-1913.

Weng, J., Weng, J., Zhang, J., Li, M., Zhang, Y., & Luo, W. (2019). Deepchain: Auditable and privacy-preserving deep learning with blockchain-based incentive. IEEE Transactions on Dependable and Secure Computing, 18(5), 2438-2455.

Purnami, A. S., Mulyoto, M., & Winoto, B. (2020). Android- based shopping skill for mentally-disable student. Journal of Education and Learning (EduLearn), 14(3), 411-415.

Altamimi, A., Alarfaj, A. A., Umer, M., Alabdulqader, E. A., Alsubai, S., Kim, T. H., & Ashraf, I. (2024). An automated approach to predict diabetic patients using KNN imputation and effective data mining techniques. BMC Medical Research Methodology, 24(1), 221.

Srinivasan, V., Hariprasath, S., & Thangavel, G. (2024). Recursive feature elimination and multisupport vector machine in healthcare analytics. In Deep Learning Applications in Translational Bioinformatics (pp. 17-32). Academic Press.

Liu, J., Li, D., Shan, W., & Liu, S. (2024). A feature selection method based on multiple feature subsets extraction and result fusion for improving classification performance. Applied Soft Computing, 150, 111018.

Arunika, M., Saranya, S., Charulekha, S., Kabilarajan, S., & Kesavan, G. (2024, June). A Survey on Explainable AI Using Machine Learning Algorithms Shap and Lime. In 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT) (pp. 1-6). IEEE.

D. R. Kale, T. S. Mane, A. Buchade, P. B. Patel, L. K. Wadhwa and R. G. Pawar, "Federated Learning for Privacy-Preserving Data Mining," 2024 International Conference on Intelligent Systems and Advanced Applications (ICISAA), Pune, India, 2024, pp. 1-6, doi: 10.1109/ICISAA62385.2024.10828741.

Salman, H. A., Kalakech, A., & Steiti, A. (2024). Random forest algorithm overview. Babylonian Journal of Machine Learning, 2024, 69-79.

Dattatray Raghunath Kale. (2024). Quantum Machine Learning Algorithms for Optimization Problems: Theory, Implementation, and Applications. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 322–331. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6218

J. E. Nalavade, R. Sachdeo, D. R. Kale, A. Buchade, M. Subhedar and S. K. Shinde, "Enhancing Road Safety: An Intelligent Drowsiness Detection System Based on Deep Neural Networks," 2024 IEEE Pune Section International Conference (PuneCon), Pune, India, 2024, pp. 1-6, doi: 10.1109/PuneCon63413.2024.10895189.

Al-Selwi, S. M., Hassan, M. F., Abdulkadir, S. J., Muneer, A., Sumiea, E. H., Alqushaibi, A., & Ragab, M. G. (2024). RNN- LSTM: From applications to modeling techniques and beyond—Systematic review. Journal of King Saud University- Computer and Information Sciences, 102068.

Rajkumar, M., Lakshi, V. S., Karthik, R., & Pavithra, S. (2024, December). Optimized Deep Learning Mechanism for Intrusion Detection: Leveraging RFE-Based Feature Selection and PCA for Improved Accuracy. In 2024 International Conference on Sustainable Communication Networks and Application (ICSCNA) (pp. 1517-1522). IEEE.

Salih, A. M., Raisi‐Estabragh, Z., Galazzo, I. B., Radeva, P., Petersen, S. E., Lekadir, K., & Menegaz, G. (2025). A perspective on explainable artificial intelligence methods: SHAP and LIME. Advanced Intelligent Systems, 7(1), 2400304.

D. R. Kale, A. Buchade, J. Nalavade, S. G. Sapate and A. J. Umbarkar, "Detecting Violations of Conditional Functional Dependencies in Distributed Database," 2023 IEEE International Conference on Blockchain and Distributed Systems Security (ICBDS), New Raipur, India, 2023, pp. 1-4, doi: 10.1109/ICBDS58040.2023.10346243.

Kale, D. R., & Aparadh, S. Y. (2016). A Study of a detection and elimination of data inconsistency in data integration. International Journal of Scientific Research in Science, Engineering and Technology, 2(1), 532-535.

Kale, D. R., Jadhav, A. N., Salunkhe, S. J., Hirve, S., & Goswami, C. (2024, October). Sharding: A Scalability Solutions for Blockchain Networks. In 2024 IEEE International Conference on Blockchain and Distributed Systems Security (ICBDS) (pp. 1-8). IEEE.

Downloads

Published

12-04-2025

Issue

Section

Research Articles

How to Cite

[1]
Sagar Pukale, Sudarshan Jagdale, Anushaka Bhandari, and Sourabh Shinde, “AI-Powered Early Detection of Diabetes Using Machine Learning on Electronic Health Records”, Int J Sci Res Sci Eng Technol, vol. 12, no. 2, pp. 578–584, Apr. 2025, doi: 10.32628/IJSRSET25122171.

Similar Articles

1-10 of 237

You may also start an advanced similarity search for this article.