Selectivity estimation of range queries in data streams using micro-clustering

المؤلفون المشاركون

Gupta, Sudhanshu
Garg, Deepak

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 13، العدد 4 (31 يوليو/تموز 2016)8ص.

الناشر

جامعة الزرقاء

تاريخ النشر

2016-07-31

دولة النشر

الأردن

عدد الصفحات

8

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

الملخص EN

estimation is an important task for query optimization.

The common data mining techniques are not applicable on large, fast and continuous data streams as they require one pass processing of data.

These requirements make range query estimation a challenging task.

We propose a technique to perform range query estimation using micro-clustering.

The technique maintains cluster statistics in terms of micro-clusters.

These micro-clusters also maintain data distribution information of the cluster values using cosine coefficients.

These cosine coefficients are used for estimating range queries.

The estimation can be done over a range of data values spread over a number of clusters.

The technique has been compared with cosine series technique for selectivity estimation.

Experiments have been conducted on both synthetic and real datasets of varying sizes and results confirm that our technique offers substantial improvements in accuracy over other methods.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Gupta, Sudhanshu& Garg, Deepak. 2016. Selectivity estimation of range queries in data streams using micro-clustering. The International Arab Journal of Information Technology،Vol. 13, no. 4.
https://search.emarefa.net/detail/BIM-654964

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Gupta, Sudhanshu& Garg, Deepak. Selectivity estimation of range queries in data streams using micro-clustering. The International Arab Journal of Information Technology Vol. 13, no. 4 (Jul. 2016).
https://search.emarefa.net/detail/BIM-654964

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Gupta, Sudhanshu& Garg, Deepak. Selectivity estimation of range queries in data streams using micro-clustering. The International Arab Journal of Information Technology. 2016. Vol. 13, no. 4.
https://search.emarefa.net/detail/BIM-654964

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes appendices.

رقم السجل

BIM-654964