A Fast Clustering Algorithm for Data with a Few Labeled Instances

المؤلفون المشاركون

Yang, Jinfeng
Xiao, Yong
Wang, Jiabing
Ma, Qianli
Shen, Yanhua

المصدر

Computational Intelligence and Neuroscience

العدد

المجلد 2015، العدد 2015 (31 ديسمبر/كانون الأول 2015)، ص ص. 1-10، 10ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2015-03-11

دولة النشر

مصر

عدد الصفحات

10

التخصصات الرئيسية

الأحياء

الملخص EN

The diameter of a cluster is the maximum intracluster distance between pairs of instances within the same cluster, and the split of a cluster is the minimum distance between instances within the cluster and instances outside the cluster.

Given a few labeled instances, this paper includes two aspects.

First, we present a simple and fast clustering algorithm with the following property: if the ratio of the minimum split to the maximum diameter (RSD) of the optimal solution is greater than one, the algorithm returns optimal solutions for three clustering criteria.

Second, we study the metric learning problem: learn a distance metric to make the RSD as large as possible.

Compared with existing metric learning algorithms, one of our metric learning algorithms is computationally efficient: it is a linear programming model rather than a semidefinite programming model used by most of existing algorithms.

We demonstrate empirically that the supervision and the learned metric can improve the clustering quality.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Yang, Jinfeng& Xiao, Yong& Wang, Jiabing& Ma, Qianli& Shen, Yanhua. 2015. A Fast Clustering Algorithm for Data with a Few Labeled Instances. Computational Intelligence and Neuroscience،Vol. 2015, no. 2015, pp.1-10.
https://search.emarefa.net/detail/BIM-1057672

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Yang, Jinfeng…[et al.]. A Fast Clustering Algorithm for Data with a Few Labeled Instances. Computational Intelligence and Neuroscience No. 2015 (2015), pp.1-10.
https://search.emarefa.net/detail/BIM-1057672

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Yang, Jinfeng& Xiao, Yong& Wang, Jiabing& Ma, Qianli& Shen, Yanhua. A Fast Clustering Algorithm for Data with a Few Labeled Instances. Computational Intelligence and Neuroscience. 2015. Vol. 2015, no. 2015, pp.1-10.
https://search.emarefa.net/detail/BIM-1057672

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1057672