A hybrid data warehouse model to improve mining algorithms
Joint Authors
al-Janabi, Kazim B. S.
Kazim, Rusul Ali
Source
Journal of Kufa for Mathematics and Computer
Issue
Vol. 4, Issue 3 (31 Dec. 2017), pp.21-30, 10 p.
Publisher
University of Kufa Faculty of Mathematics and Computers Science
Publication Date
2017-12-31
Country of Publication
Iraq
No. of Pages
10
Main Subjects
Engineering & Technology Sciences (Multidisciplinary)
Abstract EN
The performance of different Data Mining Algorithms including Classification, Clustering, Association, Prediction and others are highly related to the approaches used in Data Warehouse design and to the way the data is stored (lightly summarized, highly summarized and detailed).Detailed data is important to get detailed reports but as the amount of data is huge this represents a big challenge to the mining algorithms, on the other hand, the summarized data leads to better algorithms performance but the lack of the required knowledge may affect the overall mining process.
Knowledge extraction and mining algorithms performance and complexities represent a big challenge in data analysis field, hence the work in this paper represents a proposed approach to improve the algorithms performance throughout well designed warehouse and data reduction technique.
The work in this paper presents a hybrid warehouse galaxy model that stores data in three different formats including detailed, summarized and highly summarized data.
The time and space complexity are the major criteria in the proposed approach.
Real data was collected about schools, students and teachers from different AlNajaf AlAshraf cities, the data was preprocessed, reduced mainly through concept hierarchy and then converted into dimensions and fact tables (Warehouse Galaxy Model) which in turn are converted into multidimensional cubes.
Roll up and drill down queries were highly used to get the required information.
The resultant data cubes and in turn the corresponding warehouse model presented in this work showed a reasonable improvement in knowledge extraction algorithms for the data under discussion.
The results of the queries showed better performance of different roll up and drill down queries compared to detailed data queries.
American Psychological Association (APA)
al-Janabi, Kazim B. S.& Kazim, Rusul Ali. 2017. A hybrid data warehouse model to improve mining algorithms. Journal of Kufa for Mathematics and Computer،Vol. 4, no. 3, pp.21-30.
https://search.emarefa.net/detail/BIM-834704
Modern Language Association (MLA)
al-Janabi, Kazim B. S.& Kazim, Rusul Ali. A hybrid data warehouse model to improve mining algorithms. Journal of Kufa for Mathematics and Computer Vol. 4, no. 3 (Dec. 2017), pp.21-30.
https://search.emarefa.net/detail/BIM-834704
American Medical Association (AMA)
al-Janabi, Kazim B. S.& Kazim, Rusul Ali. A hybrid data warehouse model to improve mining algorithms. Journal of Kufa for Mathematics and Computer. 2017. Vol. 4, no. 3, pp.21-30.
https://search.emarefa.net/detail/BIM-834704
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references : p. 30
Record ID
BIM-834704