Effects of Pooling Samples on the Performance of Classification Algorithms : A Comparative Study

Joint Authors

Dehmer, Matthias
Kusonmano, Kanthida
Graber, Armin
Liedl, Klaus R.
Baumgartner, Christian
Netzer, Michael

Source

The Scientific World Journal

Issue

Vol. 2012, Issue 2012 (31 Dec. 2012), pp.1-10, 10 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2012-04-30

Country of Publication

Egypt

No. of Pages

10

Main Subjects

Natural & Life Sciences (Multidisciplinary)
Medicine
Information Technology and Computer Science

Abstract EN

A pooling design can be used as a powerful strategy to compensate for limited amounts of samples or high biological variation.

In this paper, we perform a comparative study to model and quantify the effects of virtual pooling on the performance of the widely applied classifiers, support vector machines (SVMs), random forest (RF), k-nearest neighbors (k-NN), penalized logistic regression (PLR), and prediction analysis for microarrays (PAMs).

We evaluate a variety of experimental designs using mock omics datasets with varying levels of pool sizes and considering effects from feature selection.

Our results show that feature selection significantly improves classifier performance for non-pooled and pooled data.

All investigated classifiers yield lower misclassification rates with smaller pool sizes.

RF mainly outperforms other investigated algorithms, while accuracy levels are comparable among all the remaining ones.

Guidelines are derived to identify an optimal pooling scheme for obtaining adequate predictive power and, hence, to motivate a study design that meets best experimental objectives and budgetary conditions, including time constraints.

American Psychological Association (APA)

Kusonmano, Kanthida& Netzer, Michael& Baumgartner, Christian& Dehmer, Matthias& Liedl, Klaus R.& Graber, Armin. 2012. Effects of Pooling Samples on the Performance of Classification Algorithms : A Comparative Study. The Scientific World Journal،Vol. 2012, no. 2012, pp.1-10.
https://search.emarefa.net/detail/BIM-459729

Modern Language Association (MLA)

Kusonmano, Kanthida…[et al.]. Effects of Pooling Samples on the Performance of Classification Algorithms : A Comparative Study. The Scientific World Journal No. 2012 (2012), pp.1-10.
https://search.emarefa.net/detail/BIM-459729

American Medical Association (AMA)

Kusonmano, Kanthida& Netzer, Michael& Baumgartner, Christian& Dehmer, Matthias& Liedl, Klaus R.& Graber, Armin. Effects of Pooling Samples on the Performance of Classification Algorithms : A Comparative Study. The Scientific World Journal. 2012. Vol. 2012, no. 2012, pp.1-10.
https://search.emarefa.net/detail/BIM-459729

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-459729