Default prediction model : the significant role of data engineering in the quality of outcomes

المؤلفون المشاركون

al-Nuaymat, Ghazi
al-Qarm, Ahmad
al-Hasan, Mays
al-Dibi, Mutazz Muhammad

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 17، العدد 4A (s) (31 يوليو/تموز 2020)، ص ص. 635-644، 10ص.

الناشر

جامعة الزرقاء عمادة البحث العلمي

تاريخ النشر

2020-07-31

دولة النشر

الأردن

عدد الصفحات

10

التخصصات الرئيسية

العلوم الاقتصادية والمالية وإدارة الأعمال

الملخص EN

For financial institutions and the banking industry, it is very crucial to have predictive models for their core financial activities, and especially those activities which play major roles in risk management.

Predicting loan default is one of the critical issues that banks and financial institutions focus on, as huge revenue loss could be prevented by predicting customer’s ability not only to pay back, but also to be able to do that on time.

Customer loan default prediction is a task of proactively identifying customers who are most probably to stop paying back their loans.

This is usually done by dynamically analyzing customers’ relevant information and behaviors.

This is significant so as the bank or the financial institution can estimate the borrowers’ risk.

Many different machine learning classification models and algorithms have been used to predict customers’ ability to pay back loans.

In this paper, three different classification methods (Naïve Bayes, Decision Tree, and Random Forest) are used for prediction, comprehensive different pre-processing techniques are being applied on the dataset in order to gain better data through fixing some of the main data issues like missing values and imbalanced data, and three different feature extractions algorithms are used to enhance the accuracy and the performance.

Results of the competing models were varied after applying data preprocessing techniques and features selections.

The results were compared using F1 accuracy measure.

The best model achieved an improvement of about 40%, whilst the least performing model achieved an improvement of 3% only.

This implies the significance and importance of data engineering (e.g., data preprocessing techniques and features selections) course of action in machine learning exercises.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

al-Qarm, Ahmad& al-Nuaymat, Ghazi& al-Hasan, Mays& al-Dibi, Mutazz Muhammad. 2020. Default prediction model : the significant role of data engineering in the quality of outcomes. The International Arab Journal of Information Technology،Vol. 17, no. 4A (s), pp.635-644.
https://search.emarefa.net/detail/BIM-1432248

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

al-Qarm, Ahmad…[et al.]. Default prediction model : the significant role of data engineering in the quality of outcomes. The International Arab Journal of Information Technology Vol. 17, no. 4A (Special issue) (2020), pp.635-644.
https://search.emarefa.net/detail/BIM-1432248

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

al-Qarm, Ahmad& al-Nuaymat, Ghazi& al-Hasan, Mays& al-Dibi, Mutazz Muhammad. Default prediction model : the significant role of data engineering in the quality of outcomes. The International Arab Journal of Information Technology. 2020. Vol. 17, no. 4A (s), pp.635-644.
https://search.emarefa.net/detail/BIM-1432248

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 642-643

رقم السجل

BIM-1432248