Prediction of Breast Cancer from Imbalance Respect Using Cluster-Based Undersampling Method

Joint Authors

Chen, Li
Zhang, Jue
Abid, Fazeel

Source

Journal of Healthcare Engineering

Issue

Vol. 2019, Issue 2019 (31 Dec. 2019), pp.1-10, 10 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2019-10-16

Country of Publication

Egypt

No. of Pages

10

Main Subjects

Public Health
Medicine

Abstract EN

To overcome the two-class imbalanced problem existing in the diagnosis of breast cancer, a hybrid of K-means and Boosted C5.0 (K-Boosted C5.0) is proposed which is based on undersampling.

K-means is utilized to select the informative samples near the boundary.

During the training phase, the K-means algorithm clusters the majority and minority instances and selects a similar number of instances from each cluster.

Boosted C5.0 is then used as the classifier.

As there is one different instance selection factor via clustering that encourages the diversity of the training subspace in K-Boosted C5.0, it would be a great advantage to get better performance.

To test the performance of the new hybrid classifier, it is implemented on 12 small-scale and 2 large-scale datasets, which are the often used datasets in class imbalanced learning.

The extensive experimental results show that our proposed hybrid method outperforms most of the competitive algorithms in terms of Matthews’ correlation coefficient (MCC) and accuracy indices.

It can be a good alternative to the well-known machine learning methods.

American Psychological Association (APA)

Zhang, Jue& Chen, Li& Abid, Fazeel. 2019. Prediction of Breast Cancer from Imbalance Respect Using Cluster-Based Undersampling Method. Journal of Healthcare Engineering،Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1175340

Modern Language Association (MLA)

Zhang, Jue…[et al.]. Prediction of Breast Cancer from Imbalance Respect Using Cluster-Based Undersampling Method. Journal of Healthcare Engineering No. 2019 (2019), pp.1-10.
https://search.emarefa.net/detail/BIM-1175340

American Medical Association (AMA)

Zhang, Jue& Chen, Li& Abid, Fazeel. Prediction of Breast Cancer from Imbalance Respect Using Cluster-Based Undersampling Method. Journal of Healthcare Engineering. 2019. Vol. 2019, no. 2019, pp.1-10.
https://search.emarefa.net/detail/BIM-1175340

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1175340