Comparative Analysis of Machine Learning Classification Models in Predicting Cardiovascular Disease


Abstract views: 7 / PDF downloads: 4

Authors

  • Ladislav Végh J. Selye University
  • Ondrej Takáč J. Selye University
  • Krisztina Czakóová J. Selye University
  • Daniel Dancsa J. Selye University
  • Melinda Nagy J. Selye University

Keywords:

Data Analysis, Machine Learning, Classification Models, Heart Disease, Cardiovascular Disease

Abstract

For a long time, cardiovascular diseases have been the leading cause of death worldwide.
Machine learning has found significant usage in the medical field as it can find patterns in data.
Classification models can help cardiologists to diagnose heart diseases and minimize misdiagnosis
accurately. In this paper, we explored a dataset related to heart disease and compared the accuracy of 43
machine learning classification models. The dataset for this research was downloaded from Kaggle; it
contained 1190 observations, 11 features (age, sex, chest pain type, resting blood pressure, serum
cholesterol, fasting blood sugar, resting electrocardiogram results, maximum heart rate achieved, exercise
induced angina, oldpeak, the slope of the peak exercise ST segment) and a binary target variable (no heart
disease or observed cardiovascular disease). For data exploration, preprocessing, training, testing, and
predictor importance analysis, we used MATLAB R2004a software and the Classification Learner app
included in this software. Before training machine learning classification models, we divided the dataset
into a training set (90% of observations) and a test set (10% of observations). To prevent overfitting
during the training of classification models, 10-fold cross-validation was used. The result showed that the
best accuracy was reached with an optimized ensemble classification model (validation accuracy: 0.9262
and test accuracy: 0.9580). After calculating the permutation importance of each feature, we observed that
the most important feature among all 11 features was the slope of the peak exercise ST segment.

Downloads

Download data is not yet available.

Author Biographies

Ladislav Végh, J. Selye University

Department of Informatics, Faculty of Economics and Informatics, Slovakia

Ondrej Takáč, J. Selye University

Department of Informatics, Faculty of Economics and Informatics, Slovakia

Krisztina Czakóová, J. Selye University

Department of Informatics, Faculty of Economics and Informatics, Slovakia

Daniel Dancsa, J. Selye University

Department of Biology, Faculty of Education, Slovakia

Melinda Nagy, J. Selye University

Department of Biology, Faculty of Education, Slovakia

References

World Heart Federation (2023) World Heart Report 2023. Confronting The World’s Number One Killer. [Online]. Available: https://world-heart-federation.org/wp-content/uploads/World-Heart-Report-2023.pdf

(2022) Cardiovascular Disease. [Online]. Available: https://my.clevelandclinic.org/health/diseases/21493-cardiovascular-disease

L. Végh, K. Czakóová, and O. Takáč, “Comparing Machine Learning Classification Models on a Loan Approval Prediction Dataset,” International Journal of Advanced Natural Sciences and Engineering Researches, vol. 7, no. 9, 2023, pp. 98–103. https://doi.org/10.59287/ijanser.1516

M. T. Fülöp, M. Gubán, Á. Gubán, and M. Avornicului, “Application Research of Soft Computing Based on Machine Learning Production Scheduling,” Processes, vol. 10. no. 3, 2022, paper 520. https://doi.org/10.3390/pr10030520

J. Udvaros and N. Forman, “The Merger of Machine Learning and Artificial Intelligence: New Horizons in Education 4.0,” in ICEBM 2023 6th International Conference on Economics and Business Management, Cluj-Napoca, Romania, 2023, p. 48.

C. M. Bhatt, P. Patel, T. Ghetia, and P. L. Mazzeo, “Effective Heart Disease Prediction Using Machine Learning Techniques,” Algorithms, vol. 16, issue 2, 2023, paper 88. https://doi.org/10.3390/a16020088

S. Subramani, N. Varshney, M. V. Anand, M. E. M. Soudagar, L. A. Al-keridis, T. K. Upadhyay, N. Alshammari, M. Saeed, K. Subramanian, K. Anbarasu, and K. Rohini, “Cardiovascular diseases prediction by machine learning incorporation with deep learning,” Frontiers in Medicine, vol. 10, 2023, paper 1150933. https://doi.org/10.3389/fmed.2023.1150933

M. Siddhartha. (2024) Heart Disease Dataset. [Online]. Available: https://www.kaggle.com/datasets/mexwell/heart-disease-dataset/data

M. Siddhartha. (2020) Heart Disease Dataset (Comprehensive). [Online] IEEE Dataport. Available: https://dx.doi.org/10.21227/dz4t-cm36

(2024). MATLAB. [Online]. Available: https://www.mathworks.com/products/matlab.html

(2024). Classification Learner. [Online] Available: https://www.mathworks.com/help/stats/classificationlearner-app.html

A. Ogunpola, F. Saeed, S. Basurra, A. M. Albarrak, and S. N. Qasem, “Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases,” Diagnostics, vol. 14, no. 2, 2024, paper 144. https://doi.org/10.3390/diagnostics14020144

A. Garg, B. Sharma, and R. Khan, "Heart disease prediction using machine learning techniques," in IOP Conference Series: Materials Science and Engineering, vol. 1022, 2021, paper 012046. https://doi.org/10.1088/1757-899X/1022/1/012046

B. Akkaya, E. Sener, and C. Gursu, "A Comparative Study of Heart Disease Prediction Using Machine Learning Techniques," in 2022 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA), Ankara, Turkey, 2022, pp. 1–8. https://doi.org/10.1109/HORA55278.2022.9799978

(2024) Warning signs and symptoms of heart disease. [Online] Available: https://www.mountsinai.org/health-library/selfcare-instructions/warning-signs-and-symptoms-of-heart-disease

(2024) Heart disease. [Online] Available: https://www.mayoclinic.org/diseases-conditions/heart-disease/symptoms-causes/syc-20353118

Downloads

Published

2024-07-25

How to Cite

Végh, L., Takáč, O., Czakóová, K., Dancsa, D., & Nagy, M. (2024). Comparative Analysis of Machine Learning Classification Models in Predicting Cardiovascular Disease. International Journal of Advanced Natural Sciences and Engineering Researches, 8(6), 23–31. Retrieved from https://as-proceeding.com/index.php/ijanser/article/view/1923

Conference Proceedings Volume

Section

Articles