Arabic text classification based on data mining techniques
مقدم أطروحة جامعية
مشرف أطروحة جامعية
Hattab, Izz al-Din Shakir Hasan
أعضاء اللجنة
al-Shaykh, Asim A. R.
al-Sarayirah, Bashshar
al-Lahham, Muhammad Ismail Abd al-Rasul
الجامعة
الأكاديمية العربية للعلوم المالية و المصرفية
الكلية
كلية نظم و تكنولوجيا المعلومات
القسم الأكاديمي
قسم نظم المعلومات الحاسوبية
دولة الجامعة
الأردن
الدرجة العلمية
دكتوراه
تاريخ الدرجة العلمية
2012
الملخص الإنجليزي
Classification using association rule mining also known as associative classification (AC) is a promising approach that usually builds higher accurate classifier than classic classification approaches in data mining, including probabilistic, decision trees and covering.
In this thesis, we study the problem of using AC mining in structured English and unstructured Arabic textual data collections to discover extract moderate size classifiers that end-user can easily control and understand.
The outcome is a new AC algorithm that efficiently finds knowledge (class association rules–CARs) from different classification benchmark problems, together with extensive experimental results.
Precisely, we introduce the―Reduced Classification using Association‖ (RCUA) which contains three new procedures : (1) A rule discovery procedure that uses new data representation method to efficiently discover the CARs (rules) in one data scan.
(2) A classifier building procedure that reduces the size of the classifiers and over fitting by generating a moderate size classification systems where end user can easily maintain and interpret (3) An evaluation procedure that instead of utilizing one rule for class assignment during prediction like CBA it bases its class assignment decision on multiple rules and thus improving the classification accuracy of the resulting classifiers.
During extensive experimentations using a range of real world application data (Arabic and English), we revealed that our algorithm is often more accurate than well-known traditional classification techniques and competitive with popular associative classification algorithms.
التخصصات الرئيسية
تكنولوجيا المعلومات وعلم الحاسوب
الموضوعات
عدد الصفحات
131
قائمة المحتويات
Table of contents.
Abstract.
Chapter One : introduction.
Chapter Two : literature review on text categorization.
Chapter Three : overview of classification based association rule discovery in data mining.
Chapter Four : new classification using association rule algorithm.
Chapter Five : conclusions and further works.
References.
نمط استشهاد جمعية علماء النفس الأمريكية (APA)
al-Jamal, Muhannad Said. (2012). Arabic text classification based on data mining techniques. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences, Jordan
https://search.emarefa.net/detail/BIM-306664
نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)
al-Jamal, Muhannad Said. Arabic text classification based on data mining techniques. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences. (2012).
https://search.emarefa.net/detail/BIM-306664
نمط استشهاد الجمعية الطبية الأمريكية (AMA)
al-Jamal, Muhannad Said. (2012). Arabic text classification based on data mining techniques. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences, Jordan
https://search.emarefa.net/detail/BIM-306664
لغة النص
الإنجليزية
نوع البيانات
رسائل جامعية
رقم السجل
BIM-306664
قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"
أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي
تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر