Machine learning based intelligent framework for data preprocessing

المؤلفون المشاركون

Sarwar, M. Suhayl
Sufyan, Muhammad
al-Qayyum, Zia
Abd al-Kalim

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 15، العدد 6 (30 نوفمبر/تشرين الثاني 2018)6ص.

الناشر

جامعة الزرقاء

تاريخ النشر

2018-11-30

دولة النشر

الأردن

عدد الصفحات

6

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الملخص EN

Data preprocessing having a pivotal role in data mining ensures reduction in cost by catering inconsistent, incomplete and irrelevant data through data cleansing to assist knowledge workers in making effective decisions through knowledge extraction.

Prevalent techniques are not much effective for having more manual effort, increased processing time, less accuracy percentage etc with constrained data volumes.

In this research, a comprehensive, semi-automatic pre-processing framework based on hybrid of two machine learning techniques namely Conditional Random Fields (CRF) and Hidden Markov Model (HMM) is devised for data cleansing.

Proposed framework is envisaged to be effective and flexible enough to manipulate data set of any size.

A bucket of inconsistent dataset (comprising of customer’s address directory) of Pakistan Telecommunication Company (PTCL) is used to conduct different experiments for training and validation of proposed approach.

Small percentage of semi cleansed data (output of preprocessing) is passed to hybrid of HMM and CRF for learning and rest of the data is used for testing the model.

Experiments depict superiority of higher average accuracy of 95.50% for proposed hybrid approach compared to CRF (84.5%) and HMM (88.6%) when applied in separately

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Sarwar, M. Suhayl& al-Qayyum, Zia& Sufyan, Muhammad& Abd al-Kalim. 2018. Machine learning based intelligent framework for data preprocessing. The International Arab Journal of Information Technology،Vol. 15, no. 6.
https://search.emarefa.net/detail/BIM-873978

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Sarwar, M. Suhayl…[et al.]. Machine learning based intelligent framework for data preprocessing. The International Arab Journal of Information Technology Vol. 15, no. 6 (Nov. 2018).
https://search.emarefa.net/detail/BIM-873978

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Sarwar, M. Suhayl& al-Qayyum, Zia& Sufyan, Muhammad& Abd al-Kalim. Machine learning based intelligent framework for data preprocessing. The International Arab Journal of Information Technology. 2018. Vol. 15, no. 6.
https://search.emarefa.net/detail/BIM-873978

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-873978