An improved algorithm for data preprocessing in mining crime data set

المؤلف

al-Janabi, Kazim Burayhi Sawadi

المصدر

Journal of Kufa for Mathematics and Computer

العدد

المجلد 1، العدد 4 (31 ديسمبر/كانون الأول 2011)، ص ص. 81-87، 7ص.

الناشر

جامعة الكوفة كلية الرياضيات و علوم الحاسوب

تاريخ النشر

2011-12-31

دولة النشر

العراق

عدد الصفحات

7

التخصصات الرئيسية

الرياضيات

الموضوعات

الملخص EN

-This paper presents an improved algorithm for data preprocessing to solve the problem of missing values and smoothing the outliers in the real world data sets.

Previous works in this field are based mainly on replacing the missing values with the average, class average, most common values and some other techniques in the same direction, and outliers were generally cancelled from the data set.

Crime and criminal data sets have their own special characteristics and benchmark in that missing values and outliers have different meanings than in other fields, so they need to be processed in different manners.

The algorithm is based mainly on using clustering techniques to group the objects according to their similarities and dissimilarities, then smoothing the outliers accordingly and the missing values are processed according to their clusters.

WEKA is used as a tool to find different clusters of the criminals.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

al-Janabi, Kazim Burayhi Sawadi. 2011. An improved algorithm for data preprocessing in mining crime data set. Journal of Kufa for Mathematics and Computer،Vol. 1, no. 4, pp.81-87.
https://search.emarefa.net/detail/BIM-307860

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

al-Janabi, Kazim Burayhi Sawadi. An improved algorithm for data preprocessing in mining crime data set. Journal of Kufa for Mathematics and Computer Vol. 1, no. 4 (Dec. 2011), pp.81-87.
https://search.emarefa.net/detail/BIM-307860

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

al-Janabi, Kazim Burayhi Sawadi. An improved algorithm for data preprocessing in mining crime data set. Journal of Kufa for Mathematics and Computer. 2011. Vol. 1, no. 4, pp.81-87.
https://search.emarefa.net/detail/BIM-307860

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references : p. 86-87

رقم السجل

BIM-307860