An ensemble multi-label feature selection algorithm based on information entropy

المؤلفون المشاركون

Li, Shining
Zhang, Zhenhai
Duan, Jiaqi

المصدر

The International Arab Journal of Information Technology

العدد

المجلد 11، العدد 4 (31 يوليو/تموز 2014)8ص.

الناشر

جامعة الزرقاء

تاريخ النشر

2014-07-31

دولة النشر

الأردن

عدد الصفحات

8

التخصصات الرئيسية

تكنولوجيا المعلومات وعلم الحاسوب

الموضوعات

الملخص EN

In multi-label classification, feature selection is able to remove redundant and irrelevant features, which makes the classifiers faster and improves the prediction performance of the classifiers.

Currently most of feature selection algorithms in multi-label classification are dependent on the concrete classifier, which leads to high computation complexity.

Hence this paper proposes an ensemble multi-label feature selection algorithm based on information entropy (EMFSIE), which is independent on any concrete classifiers.

Its core idea consists of: 1).

we employ the information gain to evaluate the correlation between the feature and the label set ; 2).

to filter out useful features more effectively, we calculate the information gain in an ensemble framework and filter out useful features according to the threshold value determined by the effective factor.

We validate EMFSIE on four datasets from two domains using four different multi-label classifiers.

The experimental results and their analysis show preliminarily that EMFSIE can not only remove more than 70 % of original features, which makes the classifiers faster, but also keep the prediction performance of the classifiers as good as before, even enhance the prediction performance on three datasets under the two-tailed paired t-tests at 0.05 significance level.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Li, Shining& Zhang, Zhenhai& Duan, Jiaqi. 2014. An ensemble multi-label feature selection algorithm based on information entropy. The International Arab Journal of Information Technology،Vol. 11, no. 4.
https://search.emarefa.net/detail/BIM-334388

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Li, Shining…[et al.]. An ensemble multi-label feature selection algorithm based on information entropy. The International Arab Journal of Information Technology Vol. 11, no. 4 (Jul. 2014).
https://search.emarefa.net/detail/BIM-334388

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Li, Shining& Zhang, Zhenhai& Duan, Jiaqi. An ensemble multi-label feature selection algorithm based on information entropy. The International Arab Journal of Information Technology. 2014. Vol. 11, no. 4.
https://search.emarefa.net/detail/BIM-334388

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes appendix.

رقم السجل

BIM-334388