An Explainable Artificial Intelligence Based Early Lung Cancer Risk Prediction Using LightGBM


Abstract views: 5 / PDF downloads: 3

Authors

  • Ömer Miraç Kökçam Kökçam Fırat University
  • Yunus Santur Fırat University
  • Muhammed Emre Çolak Fırat University

Keywords:

Cancer Risk Prediction, Machine Learning, Data Engineering, Explainable Artificial Intelligence

Abstract

Lung cancer is the most common cancer in the world and is also the deadliest. Early diagnosis
can improve patients’ life expectancy and reduce the cost of treatment. The aim of this study is to help
physicians and patients by using explainable artificial intelligence methods to early diagnose risk of lung
cancer. This study will create an opportunity for physicians to make early diagnosis and treatment
strategies for patients. In this study, a machine learning model was developed to predict the risk of lung
cancer by using the LightGBM algorithm. Furthermore, the SHAP method is used to explain why and
how the model's predictions are made, thus making the AI model reliable. These explanations increase the
acceptability and reliability of the predictions made by model, while helping physicians and patients
understand the model's decisions. The results obtained show that the developed LightGBM model can
predict the risk of lung cancer with a 100% accuracy rate. The model has achieved great results in terms
of accuracy rate and sensitivity. In addition, the SHAP analyses explain which features each of the
model's predictions is based on, which will increase the confidence of physicians and patients in the
decisions of artificial intelligence.

Downloads

Download data is not yet available.

Author Biographies

Ömer Miraç Kökçam Kökçam, Fırat University

Department of Software Engineering, Türkiye

Yunus Santur, Fırat University

Department of Artificial Intelligence and Data Engineering, Türkiye

Muhammed Emre Çolak, Fırat University

Department of Artificial Intelligence and Data Engineering, Türkiye

References

Y. Xue, L. Wang, Y. Zhang, Y. Zhao, ve Y. Liu, “Air pollution: A culprit of lung cancer”, J. Hazard. Mater., c. 434, s. 128937, Tem. 2022, doi: 10.1016/j.jhazmat.2022.128937.

I. A. J. D. Kusumawardani, P. Indraswari, ve N. L. G. Y. Komalasari, “Air Pollution and Lung Cancer”, J. Respirasi, c. 9, ss. 150-158, May. 2023, doi: 10.20473/jr.v9-I.2.2023.150-158.

Nooreldeen, R.; Bach, H. Current and Future Development in Lung Cancer Diagnosis. Int. J. Mol. Sci. 2021, 22, 8661. https://doi.org/10.3390/ijms22168661

International Agency for Research on Cancer. 2023. Available online: https://gco.iarc.fr/today (accessed on 11 October 2023).

N. A. Salman and S. T. Hasson, "A Prediction Approach for Small Healthcare Dataset," 2023 8th International Conference on Smart and Sustainable Technologies (SpliTech), Split/Bol, Croatia, 2023, pp. 1-5, doi: 10.23919/SpliTech58164.2023.10193552.K. Elissa, “Title of paper if known,” unpublished.

V. Cetin, H. H. Yumrukaya and C Bakir, “Detection of Lung Cancer with Enhanced Feed Forward Backpropagation Artificial Neural Networks”, Proceedings of the International Conference on Industrial Engineering and Operations Management Manila, Philippines, March 7-9, 2023,

Nasser, Ibrahim, “Lung Cancer Detection Using Artificial Neural Network” (2019). International Journal of Engineering and Information Systems (IJEAIS), Vol. 3 Issue 3, March – 2019, Pages: 17-23 , Available at SSRN: https://ssrn.com/abstract=3700556

Abdullah, Dakhaz & Mohsin Abdulazeez, Adnan & Sallow, Amira. (2021). Lung cancer Prediction and Classification based on Correlation Selection method Using Machine Learning Techniques. Qubahan Academic Journal. 1. 141-149. 10.48161/qaj.v1n2a58.

R. Patra, “Prediction of Lung Cancer Using Machine Learning Classifier”, Computing Science, Communication and Security, N. Chaubey, S. Parikh, ve K. Amin, Ed., Singapore: Springer Singapore, 2020, ss. 132-142.

Dritsas, E.; Trigka, M. Lung Cancer Risk Prediction with Machine Learning Models. Big Data Cogn. Comput. 2022, 6, 139. https://doi.org/10.3390/bdcc6040139

W. Purba, S. Wardani, D. F. . Lumbantoruan, F. C. I. Silalahi, and T. L. Edison, “OPTIMIZATION OF LUNG CANCER CLASSIFICATION METHOD USING EDA-BASED MACHINE LEARNING”, JUSIKOM PRIMA, vol. 6, no. 2, pp. 43-50, Feb. 2023.

A. Gupta, Z. Zuha, I. Ahmad, ve Z. Ansari, “A Study On Prediction Of Lung Cancer Using Machine Learning Algorithms”, In Review, preprint, Agu. 2022. doi: 10.21203/rs.3.rs-1912967/v1.

Hamed, E.A.-R.; Salem, M.A.-M.; Badr, N.L.; Tolba, M.F. An Efficient Combination of Convolutional Neural Network and LightGBM Algorithm for Lung Cancer Histopathology Classification. Diagnostics 2023, 13, 2469. https://doi.org/10.3390/diagnostics13152469

A. G V S, A. Kethaan, ve G. V. N. Akshay Varma, “Fraud Detection System Employing Machine Learning Techniques for Credit Card Transactions”, International Journal of Advanced Research in Science Communication and Technology, c. 3, ss. 2581-9429, Agu. 2023, doi: 10.48175/IJARSCT-12492.

A. Shabbir, M. Shabbir, A. R. Javed, M. Rizwan, C. Iwendi, ve C. Chakraborty, “Exploratory data analysis, classification, comparative analysis, case severity detection, and internet of things in COVID-19 telemonitoring for smart hospitals”, Journal of Experimental & Theoretical Artificial Intelligence, c. 35, sy 4, ss. 507-534, May. 2023, doi: 10.1080/0952813X.2021.1960634.

Air Pollution, Alcohol, Smoking & Risk of Lung Cancer Dataset. https://www.kaggle.com/datasets/thedevastator/cancer-patients-and-air-pollution-a-new-link Access Date : 09 Sep 2023

Kuzu A, Santur Y. Early Diagnosis and Classification of Fetal Health Status from a Fetal Cardiotocography Dataset Using Ensemble Learning. Diagnostics. 2023; 13(15):2471. https://doi.org/10.3390/diagnostics13152471

Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A Highly Efficient Gradient Boosting Decision Tree. Adv. Neural Inf. Process Syst. 2017, 30, 3149–3157.

S. Lundberg and S.-I. Lee, ‘A Unified Approach to Interpreting Model Predictions’, ArXiv170507874 Cs Stat, Nov. 2017, http://arxiv.org/abs/1705.07874

Downloads

Published

2024-10-13

How to Cite

Kökçam, Ömer M. K., Santur, Y., & Çolak, M. E. (2024). An Explainable Artificial Intelligence Based Early Lung Cancer Risk Prediction Using LightGBM. International Journal of Advanced Natural Sciences and Engineering Researches, 7(10), 50–57. Retrieved from https://as-proceeding.com/index.php/ijanser/article/view/2061

Issue

Section

Articles