Addressing Missing Data in Surveys and Implementing Imputation Methods with SPSS
Abstract views: 43 / PDF downloads: 30
Keywords:
Missing Data, Imputation, Survey, SPSS, Deterministic, ProbabilisticAbstract
The presence of missing data in surveys or in other types of scientific research poses significant
challenges for academic research, impacting the reliability, validity, and generalizability of study findings.
Missing data can introduce bias into the analysis, leading to erroneous results and conclusions. Furthermore,
missing data can compromise the statistical power of analyses, reducing the precision and accuracy of
estimates. Consequently, researchers may encounter difficulties in drawing robust conclusions or
identifying significant patterns within the data. Missing data can stem from various sources, including
participant non-response, data entry errors, survey design flaws, and respondent unwillingness to disclose
sensitive information. Many researchers struggle with how to handle missing data in their studies. They
often use simple methods like deleting all the cases with missing data, partial deletion, or filling in missing
values with a single number, such as the values of the variables mean, median, or mode. However, these
methods can be misleading because they don't take into account the reasons why data might be missing and
produce estimation errors, so other approaches are implemented to impute the missing values. However,
each approach has its limitations and assumptions, which can influence the validity of results and introduce
additional uncertainty into the analysis. This article analyzes the problem of missing data in social surveys,
the reasons for missing data, the types of missing data, and also suggests several ways, deterministic and
probabilistic, for data imputation.
Downloads
References
Schneider, C. R. (2022). Missing data in surveys: Key concepts, approaches, and applications. Research in Social and Administrative Pharmacy, 18(2), 2308-2316.
Cook, R. M. (2021). Addressing missing data in quantitative counseling research. Counseling Outcome Research and Evaluation, 12(1), 43-53.
Firat, M., Dikbas, F., Koç, A. C., & Gungor, M. (2010). Missing data analysis and homogeneity test for Turkish precipitation series. Sadhana, 35(6), 707-720.
Halidini, D., Xhafaj, E., & Gjikaj, N. Treating the missing values for the total waste recycling in Albania. Interdisiplinary Journal of Research and Development, Vol. 4, no. 1. 2017
Acock, A. C. (2005). Working with missing values. Journal of Marriage and family, 67(4), 1012-1028.
Kosova, R., Xhafaj, E., Karriqi, A., Boci, B., & Guxholli, D. (2022). Missing Data In The Oil Industry And Methods Of Imputations Using Spss: The Impact On Reserve Estimation. Journal of Multidisciplinary Engineering Science and Technology, 9(2), 15146-15155.
Lin, W. C., & Tsai, C. F. (2020). Missing value imputation: a review and analysis of the literature (2006–2017). Artificial Intelligence Review, 53, 1487-1509.
Anaby-Tavor, A., Carmeli, B., Goldbraich, E., Kantor, A., Kour, G., Shlomov, S., ... & Zwerdling, N. (2020, April). Do not have enough data? Deep learning to the rescue!. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 34, No. 05, pp. 7383-7390).
Scarelli, A., Kosova, R., & Sinaj, V. (2018). Multicriteria decision aid for measuring resilience in a three-dimensional space. DOI 10.33107/ubt-ic.2018.97
Kosova, R., Shehu, V., Naco, A., Xhafaj, E., Stana, A., & Ymeri, A. (2015). Monte Carlo simulation for estimating geologic oil reserves. A case study from Kucova Oilfield in Albania. Muzeul Olteniei Craiova. Oltenia. Studii şi comunicări. Ştiinţele Naturii, 31(2), 20-25. ISSN 1454-6914.
Kosova, R., Naco, A., & Prifti, I. (2016). Deterministic and Stochastic methods of oil field reserves estimation: A case study from KA. Oil field. Interdisplinary Journal of Research and Development, 4(2), 226-231.
Kosova, R., Subashi, D. X., Halidini, D. Q., & Kullolli, T. (2023). MCDA for evaluating the competitiveness of tourist destinations in Albania. International Journal of Advanced Natural Sciences and Engineering Researches (IJANSER), 7(11), 95–102. https://doi.org/10.59287/as-ijanser.569
Kosova, R., Halidini, D. Q., Xhafaj, E., Gjikaj, N., & Kosova, A. M. (2023). Assessing tourism service quality in Albania: A 5-scale Likert survey analysis and interpretation with SPSS. http://dx.doi.org/10.59287/as-proceedings.367
Bici, R. (2023). Simple methods to handle missing data. International Journal of Computational Economics and Econometrics, 13(2), 216-242.
Kosova, R., Qendraj, D. H., & Xhafaj, E. (2022). Meta-Analysis ELECTRE III and AHP in Evaluating and Ranking the Urban Resilience. Journal of Environmental Management & Tourism, 13(3), 756-768. DOI http://dx.doi.org/10.14505/jemt.v13.3(59).15
Gjana, A., & Kosova, R. Traditional Class, and Online Class Teaching. Comparing the Students Performance Using ANCOVA. Journal of Multidisciplinary Engineering Science and Technology (JMEST), 14806-14811.
Kosova, R., & Sinaj, V. (2021). Mathematical modeling of tourism development. An application to albanian tourism. Journal of Environmental Management & Tourism, 12(6 (54)), 1707-1715. http://dx.doi.org/10.14505//jemt.v12.6(54).26
Kosova, R., & Prifti, I. (2021). Missing data in the oil industry. Method of imputations and the impact on reserve estimation. In Conference on Applied and Industrial Mathematics (pp. 28-29).
Kosova, R., & Sinaj, V. (2020). SERVICE QUALITY AND HOTEL CUSTOMER SATISFACTION: A CASE STUDY FROM DURRES, ALBANIA. Annals Series, of' Constantin Brancusi'University of Targu-Jiu. Economy (6).
Kosova, A. G. R. The Performance Of University Students And High School Factors. Statistical Analyses And ANCOVA.
Kosova, R., Sinaj, V., Scarelli, A., Kosova, A. M., & Stana, A. (2017). Academic staff performance evaluation, using decision analyses method and application. http://dx.doi.org/10.33107/ubt-ic.2017.124
Bruni, R., Daraio, C., & Aureli, D. (2021). Imputation techniques for the reconstruction of missing interconnected data from higher educational institutions. Knowledge-Based Systems, 212, 106512.
Zdebska, W. (2021). The occurrence of missing data in surveys. Acta Scientiarum Polonorum. Oeconomia, 20(2), 95-103.
KAYA, M., & Saleem, A. M. (2023). Predictive Analysis in E-commerce: Utilizing Data Mining Techniques to Forecast Customer Purchasing Behavior. International Journal of Advanced Natural Sciences and Engineering Researches (IJANSER), 7(11), 194–200. https://doi.org/10.59287/as-ijanser.583
Hajrulla, S., Hajrulla, D., Lino, V., & Stojani, T. (2023). Statistical Data on Risk Management in University. Overview of Numerical Results. International Journal of Advanced Natural Sciences and Engineering Researches (IJANSER), 7(11), 13–19. https://doi.org/10.59287/as-ijanser.558
Demir, T., Hajrulla, S., Bezati, L., & Hajrulla, D. (2023). Application of Numerical Methods in Design of Industrial Structures. International Journal of Advanced Natural Sciences and Engineering Researches (IJANSER), 7(11), 295–300. https://doi.org/10.59287/as-ijanser.632
Akif, A. V. C. U. (2021). Effects of Different Multiple Imptutation Techniques on the Model Fit of Confirmatory Factor Analysis. Trakya Eğitim Dergisi, 11(3), 1227-1238.
Fisher, R. A. (1925, July). Theory of statistical estimation. In Mathematical proceedings of the Cambridge philosophical society (Vol. 22, No. 5, pp. 700-725). Cambridge University Press.
Little, R. J., & Rubin, D. B. (2019). Statistical analysis with missing data (Vol. 793). John Wiley & Sons.
Little, R. J. (2021). Missing data assumptions. Annual Review of Statistics and Its Application, 8, 89-107.
Grund, S., Lüdtke, O., & Robitzsch, A. (2021). On the treatment of missing data in background questionnaires in educational large-scale assessments: An evaluation of different procedures. Journal of Educational and Behavioral Statistics, 46(4), 430-465.
Nakagawa, S., & Freckleton, R. P. (2008). Missing in action: the dangers of ignoring missing data. Trends in ecology & evolution, 23(11), 592-596.
Howell, D. C. (2007). The treatment of missing data.
Tasho, E. M., & Kafazi, D. Using multiple imputation methods to analyze missing values for real and simulated data. In Book of proceedings (p. 11).
Barnum, M. (2022). Dealing with missing and incomplete data. In Handbook of Research Methods in International Relations (pp. 425-445). Edward Elgar Publishing.
Alruhaymi, A. Z., & Kim, C. J. (2021). Study on the Missing Data Mechanisms and Imputation Methods. Open Journal of Statistics, 11(4), 477-492.
Jebb, A. T., Ng, V., & Tay, L. (2021). A review of key Likert scale development advances: 1995–2019. Frontiers in psychology, 12, 637547.