Comparison of Machine Learning Algorithms to Predict Cardiovascular Heart Disease Risk Level
Abstract views: 9 / PDF downloads: 3
Keywords:
Artifical Intelligence, Diagnosis, E-health, Ensemble Learning, Machine LearningAbstract
Cardiovascular diseases can pose a potential risk for almost every individual since they are
associated with multiple parameters such as chronic disease, lifestyle, especially genetic factors. For this
purpose, within the scope of the study, machine learning-based models were developed to predict the
cardiovascular disease risk level and the metric performances of the algorithms were compared. For this
purpose, the performances of the algorithms of the models developed using a data set accessible to all
researchers were analyzed in a versatile way. In the study, the results obtained using Logistic Regression,
Decision Trees, Random Forests, K-Nearest Neighbors, Gaussian Naive Bayes and LightGBM algorithms
were compared. The results present the performance of each algorithm by evaluating it on metrics such as
accuracy, precision, sensitivity and F1 score. The study aims to illuminate in which situations different
algorithms are more effective and which variables are more determinant in terms of risk estimation. The
results of this study can be used as an auxiliary diagnostic method for healthcare professionals working in
the cardiovascular field. It can also be used as a predictive model for individuals who want to use
artificial intelligence to determine the level of risk.
Downloads
References
Dahlöf, B. (2010). Cardiovascular disease risk factors: epidemiology and risk assessment. The American journal of cardiology, 105(1), 3A-9A.
de Moraes Batista, A. F., Chiavegatto Filho, A. D. P. (2019). Machine learning aplicado à Saúde. Sociedade Brasileira de Computação.
Herm, L. V., Heinrich, K., Wanner, J., Janiesch, C. (2023). Stop ordering machine learning algorithms by their explainability! A user-centered investigation of performance and explainability. International Journal of Information Management, 69, 102538.
Karthick, K., Aruna, S. K., Samikannu, R., Kuppusamy, R., Teekaraman, Y., Thelkar, A. R. (2022). Implementation of a heart disease risk prediction model using machine learning. Computational and Mathematical Methods in Medicine, 2022.
Reddy, K. V. V., Elamvazuthi, I., Aziz, A. A., Paramasivam, S., Chua, H. N., & Pranavanand, S. (2021). Heart disease risk prediction using machine learning classifiers with attribute evaluators. Applied Sciences, 11(18), 8352.
Delpino, F. M., Costa, Â. K., Farias, S. R., Chiavegatto Filho, A. D. P., Arcêncio, R. A., Nunes, B. P. (2022). Machine learning for predicting chronic diseases: a systematic review. Public Health, 205, 14-25.
Lupague, R. M. J. M., Mabborang, R. C., Bansil, A. G., Lupague, M. M. (2023). Integrated Machine Learning Model for Comprehensive Heart Disease Risk Assessment Based on Multi-Dimensional Health Factors. European Journal of Computer Science and Information Technology, 11(3), 44-58.
Ramesh, T. R., Lilhore, U. K., Poongodi, M., Simaiya, S., Kaur, A., Hamdi, M. (2022). Predictive analysis of heart diseases with machine learning approaches. Malaysian Journal of Computer Science, 132-148.
Beam, A. L., Kohane, I. S. (2018). Big data and machine learning in health care. Jama, 319(13), 1317-1318.
Panch, T., Szolovits, P., Atun, R. (2018). Artificial intelligence, machine learning and health systems. Journal of global health, 8(2).
Yadav, B. P., Ghate, S., Harshavardhan, A., Jhansi, G., Kumar, K. S., Sudarshan, E. (2020, December). Text categorization Performance examination Using Machine Learning Algorithms. In IOP Conference Series: Materials Science and Engineering (Vol. 981, No. 2, p. 022044). IOP Publishing.
Sun, H., Ramuhalli, P., Jacob, R. E. (2023). Machine learning for ultrasonic nondestructive examination of welding defects: A systematic review. Ultrasonics, 127, 106854.
Kolk, M. Z., Deb, B., Ruipérez-Campillo, S., Bhatia, N. K., Clopton, P., Wilde, A. A., ... & Tjong, F. V. (2023). Machine learning of electrophysiological signals for the prediction of ventricular arrhythmias: systematic review and examination of heterogeneity between studies. EBioMedicine, 89.
Online: https://www.kaggle.com/datasets/alphiree/cardiovascular-diseases-risk-prediction-dataset
Khanna, D., Peltzer, C., Kahar, P., & Parmar, M. S. (2022). Body mass index (BMI): a screening tool analysis. Cureus, 14(2).
Liang, W., Luo, S., Zhao, G., & Wu, H. (2020). Predicting hard rock pillar stability using GBDT, XGBoost, and LightGBM algorithms. Mathematics, 8(5), 765.
Deng, S., Wei, M., Xu, M., & Cai, W. (2021). Prediction of the rate of penetration using logistic regression algorithm of machine learning model. Arabian Journal of Geosciences, 14, 1-13.
Pan, Z., Wang, Y., & Pan, Y. (2020). A new locally adaptive k-nearest neighbor algorithm based on discrimination class. Knowledge-Based Systems, 204, 106185.
Chen, S., Webb, G. I., Liu, L., & Ma, X. (2020). A novel selective naïve Bayes algorithm. Knowledge-Based Systems, 192, 105361.
Tangirala, S. (2020). Evaluating the impact of GINI index and information gain on classification using decision tree classifier algorithm. International Journal of Advanced Computer Science and Applications, 11(2), 612-619.
Schonlau, M., Zou, R. Y. (2020). The random forest algorithm for statistical learning. The Stata Journal, 20(1), 3-29.