A proposed framework for analyzing crime data set using decision tree and simple k-means mining algorithms

Publication Date

2011-05-31

Country of Publication

Iraq

No. of Pages

Main Subjects

Mathematics

Topics

Abstract AR

تقدم هذه الورقة البحثية إطارا و نموذجا لتحليل بيانات الجريمة باستخدام تقنيات و خوارزميات مفاهيم التنقيب عن البيانات (التصنيف و التجميع Classification and Clustering) بهدف تقديم أفضل المعلومات إلى المختصين في علم الجريمة للمساعدة في الكشف عن الجريمة.

يهدف البحث إلى مساعدة الاختصاصين في اكتشاف الأنماط و الاتجاهات للجرائم و المجرمين و إيجاد علاقات و تفسيرات محتملة للجرائم و متابعة الشبكات الإجرامية و تمييز مشتبه بهم محتملين.

إن التصنيف بشكل رئيسي يستند أساسا على تصنيف الجرائم طبقا للنوع، العنوان، وقت حصول الجريمة، صفات المشتبه بهم و غيرها.

إضافة إلى إيجاد العلاقات بين الجرائم المختلفة و الخواص الإجرامية.

و لتحقيق ذلك تم استخدام خوارزميات مختلفة لما يسمى بشجرة القرارات Decision Tree Algorithms لإجراء عملية التصنيف و تقنيات المتوسط البسيط Simple K-Mean للتجميع.

تم جمع البيانات عن الجرائم و المجرمين من البيانات الحرة على الانترنت، حيث استخدمت هذه البيانات لإنشاء و اختبار النموذج المقترح، و قد تم استخدام خوارزميات مختلفة لإعداد هذه البيانات لكي تتلاءم مع خوارزميات التنقيب المختلفة و بعد ذلك تم تطبيق خوارزميات التصنيف و التجميع للحصول على المعلومات التي تساعد في إعطاء رؤية واضحة عن الجرائم و المجرمين.

و قد استخدمت برمجياتExcel و WEKA لمعالجة و تحليل تلك البيانات.

Abstract EN

This paper presents a proposed framework for the crime and criminal data analysis and detection using Decision tree Algorithms for data classification and Simple K Means algorithm for data clustering.

The paper tends to help specialists in discovering patterns and trends, making forecasts, finding relationships and possible explanations mapping criminal networks and identifying possible suspects.

The classification is based mainly on grouping the crimes according to the type, location time and other attributes; Clustering is based on finding relationships between different Crime and Criminal attributes having some previously unknown common characteristics.

The results of both classifications and Clustering are used for prediction of trends and behavior of the given objects (Crimes and Criminals).

Data for both crimes and criminals were collected from free police departments dataset available on the Internet to create and test the proposed framework, and then these data were preprocessed to get clean and accurate data using different preprocessing techniques (cleaning, missing values and removing inconsistency).

The preprocessed data were used to find out different crime and criminal trends and behaviors, and crimes and criminals were grouped into clusters according to their important attributes WEKA mining software and Microsoft Excel were used to analyze the given data.

American Psychological Association (APA)

al-Janabi, Kazim Burayhi Sawadi. 2011. A proposed framework for analyzing crime data set using decision tree and simple k-means mining algorithms. Journal of Kufa for Mathematics and Computer،Vol. 1, no. 3, pp.8-24.
https://search.emarefa.net/detail/BIM-307816

Modern Language Association (MLA)

al-Janabi, Kazim Burayhi Sawadi. A proposed framework for analyzing crime data set using decision tree and simple k-means mining algorithms. Journal of Kufa for Mathematics and Computer Vol. 1, no. 3 (May. 2011), pp.8-24.
https://search.emarefa.net/detail/BIM-307816

American Medical Association (AMA)

al-Janabi, Kazim Burayhi Sawadi. A proposed framework for analyzing crime data set using decision tree and simple k-means mining algorithms. Journal of Kufa for Mathematics and Computer. 2011. Vol. 1, no. 3, pp.8-24.
https://search.emarefa.net/detail/BIM-307816

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 23-24

Record ID

BIM-307816

SaveSaved Print

Arab Citation & Impact Factor "Arcif"

Largest Arabic Database of Citations Analysis for the Arabic Scholarly Journals Issued in Arab World.

eMarefa Indicators
for Arab Scientific Production

"Kashif" for Checking Similarity or Plagiarism in the Arabic Researches. know more