An Incremental Kernel Density Estimator for Data Stream Computation

Joint Authors

He, Yulin
Jiang, Jie
Dai, Dexin
Fabrice, Klohoun

Source

Complexity

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-17, 17 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-02-21

Country of Publication

Egypt

No. of Pages

17

Main Subjects

Philosophy

Abstract EN

Probability density function (p.d.f.) estimation plays a very important role in the field of data mining.

Kernel density estimator (KDE) is the mostly used technology to estimate the unknown p.d.f.

for the given dataset.

The existing KDEs are usually inefficient when handling the p.d.f.

estimation problem for stream data because a bran-new KDE has to be retrained based on the combination of current data and newly coming data.

This process increases the training time and wastes the computation resource.

This article proposes an incremental kernel density estimator (I-KDE) which deals with the p.d.f.

estimation problem in the way of data stream computation.

The I-KDE updates the current KDE dynamically and gradually with the newly coming data rather than retraining the bran-new KDE with the combination of current data and newly coming data.

The theoretical analysis proves the convergence of the I-KDE only if the estimated p.d.f.

of newly coming data is convergent to its true p.d.f.

In order to guarantee the convergence of the I-KDE, a new multivariate fixed-point iteration algorithm based on the unbiased cross validation (UCV) method is developed to determine the optimal bandwidth of the KDE.

The experimental results on 10 univariate and 4 multivariate probability distributions demonstrate the feasibility and effectiveness of the I-KDE.

American Psychological Association (APA)

He, Yulin& Jiang, Jie& Dai, Dexin& Fabrice, Klohoun. 2020. An Incremental Kernel Density Estimator for Data Stream Computation. Complexity،Vol. 2020, no. 2020, pp.1-17.
https://search.emarefa.net/detail/BIM-1139962

Modern Language Association (MLA)

He, Yulin…[et al.]. An Incremental Kernel Density Estimator for Data Stream Computation. Complexity No. 2020 (2020), pp.1-17.
https://search.emarefa.net/detail/BIM-1139962

American Medical Association (AMA)

He, Yulin& Jiang, Jie& Dai, Dexin& Fabrice, Klohoun. An Incremental Kernel Density Estimator for Data Stream Computation. Complexity. 2020. Vol. 2020, no. 2020, pp.1-17.
https://search.emarefa.net/detail/BIM-1139962

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1139962