Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets

المؤلفون المشاركون

Leng, Mingwei
Cheng, Jianjun
Wang, Jinjin
Zhang, Zhengquan
Zhou, Hanhai
Chen, Xiaoyun

المصدر

Mathematical Problems in Engineering

العدد

المجلد 2013، العدد 2013 (31 ديسمبر/كانون الأول 2013)، ص ص. 1-10، 10ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2013-11-26

دولة النشر

مصر

عدد الصفحات

10

التخصصات الرئيسية

هندسة مدنية

الملخص EN

The accuracy of most of the existing semisupervised clustering algorithms based on small size of labeled dataset is low when dealing with multidensity and imbalanced datasets, and labeling data is quite expensive and time consuming in many real-world applications.

This paper focuses on active data selection and semisupervised clustering algorithm in multidensity and imbalanced datasets and proposes an active semisupervised clustering algorithm.

The proposed algorithm uses an active mechanism for data selection to minimize the amount of labeled data, and it utilizes multithreshold to expand labeled datasets on multidensity and imbalanced datasets.

Three standard datasets and one synthetic dataset are used to demonstrate the proposed algorithm, and the experimental results show that the proposed semisupervised clustering algorithm has a higher accuracy and a more stable performance in comparison to other clustering and semisupervised clustering algorithms, especially when the datasets are multidensity and imbalanced.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Leng, Mingwei& Cheng, Jianjun& Wang, Jinjin& Zhang, Zhengquan& Zhou, Hanhai& Chen, Xiaoyun. 2013. Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets. Mathematical Problems in Engineering،Vol. 2013, no. 2013, pp.1-10.
https://search.emarefa.net/detail/BIM-1010197

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Leng, Mingwei…[et al.]. Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets. Mathematical Problems in Engineering No. 2013 (2013), pp.1-10.
https://search.emarefa.net/detail/BIM-1010197

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Leng, Mingwei& Cheng, Jianjun& Wang, Jinjin& Zhang, Zhengquan& Zhou, Hanhai& Chen, Xiaoyun. Active Semisupervised Clustering Algorithm with Label Propagation for Imbalanced and Multidensity Datasets. Mathematical Problems in Engineering. 2013. Vol. 2013, no. 2013, pp.1-10.
https://search.emarefa.net/detail/BIM-1010197

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1010197