CFD_mine : discovery of conditional functional dependencies in relational databases
Dissertant
Hakawati, Muhammad Raghib Muhammad
Thesis advisor
Comitee Members
Shilbayah, Nidal
al-Atum, Jalal
Mamluk, Rustum
University
Middle East University
Faculty
Faculty of Information Technology
Department
Computer Science Department
University Country
Jordan
Degree
Master
Degree Date
2009
English Abstract
Dirty data (i.e.
containing inconsistences, conflict and errors) is a serious problem for many organizations leading to incorrect decision making, inefficient daily operations, and ultimately wasting both time and money.
Dirty data in a database often emerge as violation of integrity constraints, meant to preserve data consistency and accuracy.
Conditional Functional Dependencies (CFDs) have recently been introduced for data cleaning.
CFDs extends Functional Dependencies (FDs) by enforcing patterns of semantically related values , and have proved more effective in catching data inconsistencies than FDs , which were currently the basis of many data-Cleaning tools Discovery of CFDs existing in an instance of a relation is an expensive process that involves intensive manual effort.
In this thesis, the researcher develops an effective algorithm, called CFD_Mine for discovering CFDs in a relation instance.
CFD_Mine is a Levelwise algorithm that extends TANE, a well-known algorithm for discovering FDs.
it searches for minimal CFDs among the data values and prunes redundant candidates.
An experimental study is presented for showing the scalability of our algorithm .Finally the results show that CFD_Mine works well when a given sample relation is large and scales well will the arity of the relation.
Main Subjects
Business Administration
Information Technology and Computer Science
Topics
No. of Pages
54
Table of Contents
Table of contents.
Abstract.
Abstract in Arabic.
Chapter One : Introduction.
Chapter Two : Literature review.
Chapter Three : CFD-mine algorithm.
Chapter Four : Experimental evaluation.
Chapter Five : Conclusions and future work.
References.
American Psychological Association (APA)
Hakawati, Muhammad Raghib Muhammad. (2009). CFD_mine : discovery of conditional functional dependencies in relational databases. (Master's theses Theses and Dissertations Master). Middle East University, Jordan
https://search.emarefa.net/detail/BIM-698627
Modern Language Association (MLA)
Hakawati, Muhammad Raghib Muhammad. CFD_mine : discovery of conditional functional dependencies in relational databases. (Master's theses Theses and Dissertations Master). Middle East University. (2009).
https://search.emarefa.net/detail/BIM-698627
American Medical Association (AMA)
Hakawati, Muhammad Raghib Muhammad. (2009). CFD_mine : discovery of conditional functional dependencies in relational databases. (Master's theses Theses and Dissertations Master). Middle East University, Jordan
https://search.emarefa.net/detail/BIM-698627
Language
English
Data Type
Arab Theses
Record ID
BIM-698627