K Important Neighbors: A Novel Approach to Binary Classification in High Dimensional Data

المؤلفون المشاركون

Zare, Najaf
Pourahmad, Saeedeh
Raeisi Shahraki, Hadi

المصدر

BioMed Research International

العدد

المجلد 2017، العدد 2017 (31 ديسمبر/كانون الأول 2017)، ص ص. 1-9، 9ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2017-12-11

دولة النشر

مصر

عدد الصفحات

9

التخصصات الرئيسية

الطب البشري

الملخص EN

K nearest neighbors (KNN) are known as one of the simplest nonparametric classifiers but in high dimensional setting accuracy of KNN are affected by nuisance features.

In this study, we proposed the K important neighbors (KIN) as a novel approach for binary classification in high dimensional problems.

To avoid the curse of dimensionality, we implemented smoothly clipped absolute deviation (SCAD) logistic regression at the initial stage and considered the importance of each feature in construction of dissimilarity measure with imposing features contribution as a function of SCAD coefficients on Euclidean distance.

The nature of this hybrid dissimilarity measure, which combines information of both features and distances, enjoys all good properties of SCAD penalized regression and KNN simultaneously.

In comparison to KNN, simulation studies showed that KIN has a good performance in terms of both accuracy and dimension reduction.

The proposed approach was found to be capable of eliminating nearly all of the noninformative features because of utilizing oracle property of SCAD penalized regression in the construction of dissimilarity measure.

In very sparse settings, KIN also outperforms support vector machine (SVM) and random forest (RF) as the best classifiers.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Raeisi Shahraki, Hadi& Pourahmad, Saeedeh& Zare, Najaf. 2017. K Important Neighbors: A Novel Approach to Binary Classification in High Dimensional Data. BioMed Research International،Vol. 2017, no. 2017, pp.1-9.
https://search.emarefa.net/detail/BIM-1138672

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Raeisi Shahraki, Hadi…[et al.]. K Important Neighbors: A Novel Approach to Binary Classification in High Dimensional Data. BioMed Research International No. 2017 (2017), pp.1-9.
https://search.emarefa.net/detail/BIM-1138672

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Raeisi Shahraki, Hadi& Pourahmad, Saeedeh& Zare, Najaf. K Important Neighbors: A Novel Approach to Binary Classification in High Dimensional Data. BioMed Research International. 2017. Vol. 2017, no. 2017, pp.1-9.
https://search.emarefa.net/detail/BIM-1138672

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1138672