Stratification-Based Outlier Detection over the Deep Web

Joint Authors

Cui, Zhiming
Zhao, Pengpeng
Xian, Xuefeng
Sheng, Victor S.
Fang, Ligang
Gu, Caidong
Yang, Yuanfeng

Source

Computational Intelligence and Neuroscience

Issue

Vol. 2016, Issue 2016 (31 Dec. 2015), pp.1-13, 13 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2016-05-25

Country of Publication

Egypt

No. of Pages

13

Main Subjects

Biology

Abstract EN

For many applications, finding rare instances or outliers can be more interesting than finding common patterns.

Existing work in outlier detection never considers the context of deep web.

In this paper, we argue that, for many scenarios, it is more meaningful to detect outliers over deep web.

In the context of deep web, users must submit queries through a query interface to retrieve corresponding data.

Therefore, traditional data mining methods cannot be directly applied.

The primary contribution of this paper is to develop a new data mining method for outlier detection over deep web.

In our approach, the query space of a deep web data source is stratified based on a pilot sample.

Neighborhood sampling and uncertainty sampling are developed in this paper with the goal of improving recall and precision based on stratification.

Finally, a careful performance evaluation of our algorithm confirms that our approach can effectively detect outliers in deep web.

American Psychological Association (APA)

Xian, Xuefeng& Zhao, Pengpeng& Sheng, Victor S.& Fang, Ligang& Gu, Caidong& Yang, Yuanfeng…[et al.]. 2016. Stratification-Based Outlier Detection over the Deep Web. Computational Intelligence and Neuroscience،Vol. 2016, no. 2016, pp.1-13.
https://search.emarefa.net/detail/BIM-1099752

Modern Language Association (MLA)

Xian, Xuefeng…[et al.]. Stratification-Based Outlier Detection over the Deep Web. Computational Intelligence and Neuroscience Vol. 2016, no. 2016 (2015), pp.1-13.
https://search.emarefa.net/detail/BIM-1099752

American Medical Association (AMA)

Xian, Xuefeng& Zhao, Pengpeng& Sheng, Victor S.& Fang, Ligang& Gu, Caidong& Yang, Yuanfeng…[et al.]. Stratification-Based Outlier Detection over the Deep Web. Computational Intelligence and Neuroscience. 2016. Vol. 2016, no. 2016, pp.1-13.
https://search.emarefa.net/detail/BIM-1099752

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1099752