Developing an efficient mechanism for mining association rules in small large transactional databases
Dissertant
Thesis advisor
Comitee Members
al-Shammari, Husayn Hadi Uwayyid
al-Shaykh, Asim A. R.
al-Lahham, Muhammad Ismail Abd al-Rasul
University
Arab Academy for Financial and Banking Sciences
Faculty
The Faculty of Information Systems and Technology
Department
Computer information systems
University Country
Jordan
Degree
Ph.D.
Degree Date
2012
English Abstract
This thesis presents a description of a newly developed high performance association rule mining algorithm that basically combines the well-known Apriori association rule mining algorithm with a compact mechanism, namely, the Dynamic Compact Grouping Item sets (DCGI).
DCGI is specially designed for optimizing online and very large transactional databases.
Therefore, this new algorithm is called Online Dynamic Grouping Association Rule Mining (ODGARM) algorithm.
ODGARM has a number of advantages over existing algorithms, such as: the first to provide efficient and accurate online dynamic grouping association rule mining on transactional databases of various sizes, allow generating frequent patterns quickly by skipping the repetitive database scan and reducing a great amount of time per database scan; reduce the search space (candidate's item sets) and discover only those rules which can be interesting for the user; reduce the memory requirements to store a huge number of useless candidates; and enable good diversification of the search space and in many cases decreases the time needed for the algorithm to generate a set of large frequent item sets.
The main corner stone of ODGARM are: inclusion of DCGI as a compact pre-processing mechanism, penalizing rules with few item sets using weight thresholds, inclusion of a new mechanism supported with developed mathematical model to prevent eliminating good item sets that might be part of the best solutions we are looking for, avoid one-at-a-time transaction processing and retrieval from the database, and avoid spending time improving low starting quality solution.
In order to evaluate and compare the performance of ODGARM with other algorithms in the literature, four scenarios are simulated using synthetic and real datasets.
The first three scenarios investigate the effect of input parameters, such as: minimum support, number of items in item set, and number of database transactions.
The forth scenario presents a comparison between the performance of ODGARM against the performance of other algorithms found in the literature (e.g., Apriori, Apriori-Tad, Apriori Hybrid, FP-Growth, and Partition).
Main Subjects
Information Technology and Computer Science
Topics
No. of Pages
84
Table of Contents
Table of contents.
Abstract.
Chapter One : introduction.
Chapter Two : literature survey and previous work.
Chapter Three : the online dynamic grouping association rule mining (ODGARM) algorithm.
Chapter Four : results and discussions.
Chapter Five : conclusions and recommendations for future work.
References.
American Psychological Association (APA)
al-Sharman, Husam Muhammad. (2012). Developing an efficient mechanism for mining association rules in small large transactional databases. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences, Jordan
https://search.emarefa.net/detail/BIM-306659
Modern Language Association (MLA)
al-Sharman, Husam Muhammad. Developing an efficient mechanism for mining association rules in small large transactional databases. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences. (2012).
https://search.emarefa.net/detail/BIM-306659
American Medical Association (AMA)
al-Sharman, Husam Muhammad. (2012). Developing an efficient mechanism for mining association rules in small large transactional databases. (Doctoral dissertations Theses and Dissertations Master). Arab Academy for Financial and Banking Sciences, Jordan
https://search.emarefa.net/detail/BIM-306659
Language
English
Data Type
Arab Theses
Record ID
BIM-306659