A hybrid data warehouse model to improve mining algorithms

Joint Authors

al-Janabi, Kazim B. S.
Kazim, Rusul Ali

Source

Journal of Kufa for Mathematics and Computer

Issue

Vol. 4, Issue 3 (31 Dec. 2017), pp.21-30, 10 p.

Publisher

University of Kufa Faculty of Mathematics and Computers Science

Publication Date

2017-12-31

Country of Publication

Iraq

No. of Pages

10

Main Subjects

Engineering & Technology Sciences (Multidisciplinary)

Abstract EN

The performance of different Data Mining Algorithms including Classification, Clustering, Association, Prediction and others are highly related to the approaches used in Data Warehouse design and to the way the data is stored (lightly summarized, highly summarized and detailed).Detailed data is important to get detailed reports but as the amount of data is huge this represents a big challenge to the mining algorithms, on the other hand, the summarized data leads to better algorithms performance but the lack of the required knowledge may affect the overall mining process.

Knowledge extraction and mining algorithms performance and complexities represent a big challenge in data analysis field, hence the work in this paper represents a proposed approach to improve the algorithms performance throughout well designed warehouse and data reduction technique.

The work in this paper presents a hybrid warehouse galaxy model that stores data in three different formats including detailed, summarized and highly summarized data.

The time and space complexity are the major criteria in the proposed approach.

Real data was collected about schools, students and teachers from different AlNajaf AlAshraf cities, the data was preprocessed, reduced mainly through concept hierarchy and then converted into dimensions and fact tables (Warehouse Galaxy Model) which in turn are converted into multidimensional cubes.

Roll up and drill down queries were highly used to get the required information.

The resultant data cubes and in turn the corresponding warehouse model presented in this work showed a reasonable improvement in knowledge extraction algorithms for the data under discussion.

The results of the queries showed better performance of different roll up and drill down queries compared to detailed data queries.

American Psychological Association (APA)

al-Janabi, Kazim B. S.& Kazim, Rusul Ali. 2017. A hybrid data warehouse model to improve mining algorithms. Journal of Kufa for Mathematics and Computer،Vol. 4, no. 3, pp.21-30.
https://search.emarefa.net/detail/BIM-834704

Modern Language Association (MLA)

al-Janabi, Kazim B. S.& Kazim, Rusul Ali. A hybrid data warehouse model to improve mining algorithms. Journal of Kufa for Mathematics and Computer Vol. 4, no. 3 (Dec. 2017), pp.21-30.
https://search.emarefa.net/detail/BIM-834704

American Medical Association (AMA)

al-Janabi, Kazim B. S.& Kazim, Rusul Ali. A hybrid data warehouse model to improve mining algorithms. Journal of Kufa for Mathematics and Computer. 2017. Vol. 4, no. 3, pp.21-30.
https://search.emarefa.net/detail/BIM-834704

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references : p. 30

Record ID

BIM-834704