A hybrid model with feature selection and hyper parameters for detecting diabetes in PIMA Indian dataset


Abstract views: 22 / PDF downloads: 27

Authors

  • Cihan Açıkyürek Department of Electrical and Electronics Engineering/Faculty of Engineering and Architecture, Yozgat Bozok University, Yozgat, Türkiye
  • Gökalp Çınarer Departmentof Computer Engineering/Faculty of Engineering and Architecture, Yozgat Bozok University, Yozgat, Türkiye

DOI:

https://doi.org/10.59287/icias.1567

Keywords:

Machine Learning, Diabetes Prediction, Artificial Intelligence, Hybrid Model

Abstract

Diabetes is a prevalent global health concern, with the timely detection of the disease playing a crucial role in treatment and prevention. Artificial Intelligence (AI) and Machine Learning (ML) algorithms have gained prominence due to their ability to analyze large datasets, aiding in disease diagnosis and treatment. This study focuses on developing accurate models for the early diagnosis of diabetes. We explored the performance of various ML algorithms, including K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Logistic Regression (LR), Extra Trees (ET), AdaBoost (AB), and Gradient Boosting (GB) while also employing different preprocessing techniques, hyperparameter tuning, XGBoost feature selection and crossover strategies. Furthermore, we tested a hybrid model using validation scenarios to assess its effectiveness. The study's outcomes revealed that the Logistic Regression algorithm achieved the highest classification accuracy, reaching 77%. This result highlights the potential of ML techniques, particularly Logistic Regression, in early diabetes diagnosis.

Downloads

Download data is not yet available.

Downloads

Published

2023-10-06

How to Cite

Açıkyürek, C., & Çınarer, G. (2023). A hybrid model with feature selection and hyper parameters for detecting diabetes in PIMA Indian dataset . International Conference on Innovative Academic Studies, 3(1), 427–435. https://doi.org/10.59287/icias.1567