A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning

المؤلفون المشاركون

Biuk-Aghai, Robert P.
Si, Yain-whar
Yap, Bee Wah
Fong, Simon

المصدر

Mathematical Problems in Engineering

العدد

المجلد 2015، العدد 2015 (31 ديسمبر/كانون الأول 2015)، ص ص. 1-11، 11ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2015-03-01

دولة النشر

مصر

عدد الصفحات

11

التخصصات الرئيسية

هندسة مدنية

الملخص EN

A prime objective in constructing data streaming mining models is to achieve good accuracy, fast learning, and robustness to noise.

Although many techniques have been proposed in the past, efforts to improve the accuracy of classification models have been somewhat disparate.

These techniques include, but are not limited to, feature selection, dimensionality reduction, and the removal of noise from training data.

One limitation common to all of these techniques is the assumption that the full training dataset must be applied.

Although this has been effective for traditional batch training, it may not be practical for incremental classifier learning, also known as data stream mining, where only a single pass of the data stream is seen at a time.

Because data streams can amount to infinity and the so-called big data phenomenon, the data preprocessing time must be kept to a minimum.

This paper introduces a new data preprocessing strategy suitable for the progressive purging of noisy data from the training dataset without the need to process the whole dataset at one time.

This strategy is shown via a computer simulation to provide the significant benefit of allowing for the dynamic removal of bad records from the incremental classifier learning process.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Fong, Simon& Biuk-Aghai, Robert P.& Si, Yain-whar& Yap, Bee Wah. 2015. A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning. Mathematical Problems in Engineering،Vol. 2015, no. 2015, pp.1-11.
https://search.emarefa.net/detail/BIM-1072940

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Fong, Simon…[et al.]. A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning. Mathematical Problems in Engineering No. 2015 (2015), pp.1-11.
https://search.emarefa.net/detail/BIM-1072940

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Fong, Simon& Biuk-Aghai, Robert P.& Si, Yain-whar& Yap, Bee Wah. A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning. Mathematical Problems in Engineering. 2015. Vol. 2015, no. 2015, pp.1-11.
https://search.emarefa.net/detail/BIM-1072940

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1072940