A hybrid model with feature selection and hyper parameters for detecting diabetes in PIMA Indian dataset
Abstract views: 22 / PDF downloads: 27
DOI:
https://doi.org/10.59287/icias.1567Keywords:
Machine Learning, Diabetes Prediction, Artificial Intelligence, Hybrid ModelAbstract
Diabetes is a prevalent global health concern, with the timely detection of the disease playing a crucial role in treatment and prevention. Artificial Intelligence (AI) and Machine Learning (ML) algorithms have gained prominence due to their ability to analyze large datasets, aiding in disease diagnosis and treatment. This study focuses on developing accurate models for the early diagnosis of diabetes. We explored the performance of various ML algorithms, including K-Nearest Neighbor (KNN), Support Vector Machine (SVM), Logistic Regression (LR), Extra Trees (ET), AdaBoost (AB), and Gradient Boosting (GB) while also employing different preprocessing techniques, hyperparameter tuning, XGBoost feature selection and crossover strategies. Furthermore, we tested a hybrid model using validation scenarios to assess its effectiveness. The study's outcomes revealed that the Logistic Regression algorithm achieved the highest classification accuracy, reaching 77%. This result highlights the potential of ML techniques, particularly Logistic Regression, in early diabetes diagnosis.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 International Conference on Innovative Academic Studies
This work is licensed under a Creative Commons Attribution 4.0 International License.