Assesing the stability and selection performance of feature selection methods under different data complexity

Joint Authors

al-Husni, Umaymah
Starkey, Andrew

Source

The International Arab Journal of Information Technology

Issue

Vol. 19, Issue 3A (s) (31 May. 2022), pp.442-455, 14 p.

Publisher

Zarqa University Deanship of Scientific Research

Publication Date

2022-05-31

Country of Publication

Jordan

No. of Pages

14

Main Subjects

Information Technology and Computer Science

Abstract EN

Our study aims to investigate the stability and the selection accuracy of feature selection performance under different data complexity.

The motivation behind this investigation is that there are significant contributions in the research community from examining the effect of complex data characteristics such as overlapping classes or non-linearity of the decision boundaries on the classification algorithm's performance; however, relatively few studies have investigated the stability and the selection accuracy of feature selection methods with such data characteristics.

Also, this study is interested in investigating the interactive effects of the classes overlapped with other data challenges such as small sample size, high dimensionality associated with irrelevant features, and imbalance classes to provide meaningful insights into the root causes for feature selection methods misdiagnosing the relevant features among different real-world data challenges.

This analysis will be extended to real-world data to guide the practitioners and researchers in choosing the correct feature selection methods that are more appropriate for a particular dataset.

Our study outcomes indicate that using feature selection techniques with datasets of different characteristics may generate different subsets of features under variations to the training data showing that small sample size and overlapping classes have the highest impact on the stability and selection accuracy of feature selection performance, among other data challenges that have been investigated in this study.

Also, in this study, we will provide a survey on the current state of research in the feature selection stability context to highlight the area that requires more attention for other researchers.

American Psychological Association (APA)

al-Husni, Umaymah& Starkey, Andrew. 2022. Assesing the stability and selection performance of feature selection methods under different data complexity. The International Arab Journal of Information Technology،Vol. 19, no. 3A (s), pp.442-455.
https://search.emarefa.net/detail/BIM-1437109

Modern Language Association (MLA)

al-Husni, Umaymah& Starkey, Andrew. Assesing the stability and selection performance of feature selection methods under different data complexity. The International Arab Journal of Information Technology Vol. 19, no. 3A (Special issue) (2022), pp.442-455.
https://search.emarefa.net/detail/BIM-1437109

American Medical Association (AMA)

al-Husni, Umaymah& Starkey, Andrew. Assesing the stability and selection performance of feature selection methods under different data complexity. The International Arab Journal of Information Technology. 2022. Vol. 19, no. 3A (s), pp.442-455.
https://search.emarefa.net/detail/BIM-1437109

Data Type

Journal Articles

Language

English

Record ID

BIM-1437109