A method for finding the appropriate number of clusters

Joint Authors

Doan, Huan
Nguyen, Dinh Thuan

Source

The International Arab Journal of Information Technology

Issue

Vol. 15, Issue 4 (31 Jul. 2018)8 p.

Publisher

Zarqa University

Publication Date

2018-07-31

Country of Publication

Jordan

No. of Pages

8

Main Subjects

Information Technology and Computer Science

Abstract EN

Drawback of almost partition based clustering algorithms is the requirement for the number of clusters specified at the beginning.

Identifying the true number of clusters at the beginning is a difficult problem.

So far, there were some works studied on this issue but no method is perfect in every case.

This paper proposes a method to find the appropriate number of clusters in the clustering process by making an index indicated the appropriate number of clusters.

This index is built from the intra-cluster coefficient and inter-cluster coefficient.

The intra-cluster coefficient reflects intra-distortion of the cluster.

The inter-cluster coefficient reflects the distance among clusters.

Those coefficients are made only by extremely marginal objects of clusters.

The looking for the extremely marginal objects and the building of the index are integrated in a weighted FCM algorithm and it is calculated suitably while the weighted FCM is processing.

The Extended weighted FCM algorithm integrated this index is called FCM-E.

Not only does the FCM-E seek the clusters, but it also finds the appropriate number of clusters.

The authors experiment with the FCM-E on some data sets of UCI: Iris, Wine, Breast Cancer Wisconsin, and Glass and compare the results of the proposed method with the results of the other methods.

The results of proposed method obtained are encouraging

American Psychological Association (APA)

Doan, Huan& Nguyen, Dinh Thuan. 2018. A method for finding the appropriate number of clusters. The International Arab Journal of Information Technology،Vol. 15, no. 4.
https://search.emarefa.net/detail/BIM-839054

Modern Language Association (MLA)

Doan, Huan& Nguyen, Dinh Thuan. A method for finding the appropriate number of clusters. The International Arab Journal of Information Technology Vol. 15, no. 4 (Jul. 2018).
https://search.emarefa.net/detail/BIM-839054

American Medical Association (AMA)

Doan, Huan& Nguyen, Dinh Thuan. A method for finding the appropriate number of clusters. The International Arab Journal of Information Technology. 2018. Vol. 15, no. 4.
https://search.emarefa.net/detail/BIM-839054

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-839054