Choosing appropriate imputation methods for missing data : a decision algorithm on methods for missing data

Other Title(s)

اختيار طرق احتساب مناسبة للبيانات المفقودة بتحديد طرق لوغارتمية للبيانات

Joint Authors

Mahmud, Wisam Ali
Rashid, Muhammad Sabah
Khayri, Tibah Wala al-Din

Source

al-Qadisiyah Journal for Computer Science and Mathematics

Issue

Vol. 11, Issue 2 (31 Dec. 2019), pp.65-73, 9 p.

Publisher

University of al-Qadisiyah College of computer Science and Information Technology

Publication Date

2019-12-31

Country of Publication

Iraq

No. of Pages

9

Main Topic

Information Technology and Computer Science

Abstract EN

Missing values commonly happen in the realm of medical research, which is regarded creating a lot of bias in case it is neglected with poor handling. However, while dealing with such challenges, some standard statistical methods have been already developed and available, yet no credible method is available so far to infer credible estimates. The existing data size gets lowered, apart from a decrease in efficiency happens when missing values is found in a dataset. A number of imputation methods have addressed such challenges in early scholarly works for handling missing values. Some of the regular methods include complete case method, mean imputation method, Last Observation Carried Forward (LOCF) method, Expectation-Maximization (EM) algorithm, and Markov Chain Monte Carlo (MCMC), Mean Imputation (Mean), Hot Deck (HOT), Regression Imputation (Regress), K-nearest neighbor (KNN),K-Mean Clustering, Fuzzy K-Mean Clustering, Support Vector Machine, and Multiple Imputation (MI) method. In the present paper, a simulation study is attempted for carrying out an investigative exploration into the efficacy of the above mentioned archetypal imputation methods along with longitudinal data setting under missing completely at random (MCAR). We took out missingness from three cases in a block having low missingness of 5% as well as higher levels at 30% and 50%. With this simulation study, we concluded LOCF method having more bias than the other methods in most of the situations after carrying out a comparison through simulation study.

American Psychological Association (APA)

Mahmud, Wisam Ali& Rashid, Muhammad Sabah& Khayri, Tibah Wala al-Din. 2019. Choosing appropriate imputation methods for missing data : a decision algorithm on methods for missing data. al-Qadisiyah Journal for Computer Science and Mathematics،Vol. 11, no. 2, pp.65-73.
https://search.emarefa.net/detail/BIM-883450

Modern Language Association (MLA)

Mahmud, Wisam Ali…[et al.]. Choosing appropriate imputation methods for missing data : a decision algorithm on methods for missing data. al-Qadisiyah Journal for Computer Science and Mathematics Vol. 11, no. 2 (2019), pp.65-73.
https://search.emarefa.net/detail/BIM-883450

American Medical Association (AMA)

Mahmud, Wisam Ali& Rashid, Muhammad Sabah& Khayri, Tibah Wala al-Din. Choosing appropriate imputation methods for missing data : a decision algorithm on methods for missing data. al-Qadisiyah Journal for Computer Science and Mathematics. 2019. Vol. 11, no. 2, pp.65-73.
https://search.emarefa.net/detail/BIM-883450

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 71-72

Record ID

BIM-883450