CFD_mine : discovery of conditional functional dependencies in relational databases

Dissertant

Hakawati, Muhammad Raghib Muhammad

Thesis advisor

Aqil, Misbah Jumah

Comitee Members

Shilbayah, Nidal
al-Atum, Jalal
Mamluk, Rustum

University

Middle East University

Faculty

Faculty of Information Technology

Department

Computer Science Department

University Country

Jordan

Degree

Master

Degree Date

2009

English Abstract

Dirty data (i.e.

containing inconsistences, conflict and errors) is a serious problem for many organizations leading to incorrect decision making, inefficient daily operations, and ultimately wasting both time and money.

Dirty data in a database often emerge as violation of integrity constraints, meant to preserve data consistency and accuracy.

Conditional Functional Dependencies (CFDs) have recently been introduced for data cleaning.

CFDs extends Functional Dependencies (FDs) by enforcing patterns of semantically related values , and have proved more effective in catching data inconsistencies than FDs , which were currently the basis of many data-Cleaning tools Discovery of CFDs existing in an instance of a relation is an expensive process that involves intensive manual effort.

In this thesis, the researcher develops an effective algorithm, called CFD_Mine for discovering CFDs in a relation instance.

CFD_Mine is a Levelwise algorithm that extends TANE, a well-known algorithm for discovering FDs.

it searches for minimal CFDs among the data values and prunes redundant candidates.

An experimental study is presented for showing the scalability of our algorithm .Finally the results show that CFD_Mine works well when a given sample relation is large and scales well will the arity of the relation.

Main Subjects

Business Administration
Information Technology and Computer Science

Topics

No. of Pages

54

Table of Contents

Table of contents.

Abstract.

Abstract in Arabic.

Chapter One : Introduction.

Chapter Two : Literature review.

Chapter Three : CFD-mine algorithm.

Chapter Four : Experimental evaluation.

Chapter Five : Conclusions and future work.

References.

American Psychological Association (APA)

Hakawati, Muhammad Raghib Muhammad. (2009). CFD_mine : discovery of conditional functional dependencies in relational databases. (Master's theses Theses and Dissertations Master). Middle East University, Jordan
https://search.emarefa.net/detail/BIM-698627

Modern Language Association (MLA)

Hakawati, Muhammad Raghib Muhammad. CFD_mine : discovery of conditional functional dependencies in relational databases. (Master's theses Theses and Dissertations Master). Middle East University. (2009).
https://search.emarefa.net/detail/BIM-698627

American Medical Association (AMA)

Hakawati, Muhammad Raghib Muhammad. (2009). CFD_mine : discovery of conditional functional dependencies in relational databases. (Master's theses Theses and Dissertations Master). Middle East University, Jordan
https://search.emarefa.net/detail/BIM-698627

Language

English

Data Type

Arab Theses

Record ID

BIM-698627