Determining Cutoff Point of Ensemble Trees Based on Sample Size in Predicting Clinical Dose with DNA Microarray Data

Joint Authors

Karabulut, Erdem
Yılmaz Isıkhan, Selen
Alpar, Celal Reha

Source

Computational and Mathematical Methods in Medicine

Issue

Vol. 2016, Issue 2016 (31 Dec. 2016), pp.1-9, 9 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2016-12-20

Country of Publication

Egypt

No. of Pages

9

Main Subjects

Medicine

Abstract EN

Background/Aim.

Evaluating the success of dose prediction based on genetic or clinical data has substantially advanced recently.

The aim of this study is to predict various clinical dose values from DNA gene expression datasets using data mining techniques.

Materials and Methods.

Eleven real gene expression datasets containing dose values were included.

First, important genes for dose prediction were selected using iterative sure independence screening.

Then, the performances of regression trees (RTs), support vector regression (SVR), RT bagging, SVR bagging, and RT boosting were examined.

Results.

The results demonstrated that a regression-based feature selection method substantially reduced the number of irrelevant genes from raw datasets.

Overall, the best prediction performance in nine of 11 datasets was achieved using SVR; the second most accurate performance was provided using a gradient-boosting machine (GBM).

Conclusion.

Analysis of various dose values based on microarray gene expression data identified common genes found in our study and the referenced studies.

According to our findings, SVR and GBM can be good predictors of dose-gene datasets.

Another result of the study was to identify the sample size of n=25 as a cutoff point for RT bagging to outperform a single RT.

American Psychological Association (APA)

Yılmaz Isıkhan, Selen& Karabulut, Erdem& Alpar, Celal Reha. 2016. Determining Cutoff Point of Ensemble Trees Based on Sample Size in Predicting Clinical Dose with DNA Microarray Data. Computational and Mathematical Methods in Medicine،Vol. 2016, no. 2016, pp.1-9.
https://search.emarefa.net/detail/BIM-1100174

Modern Language Association (MLA)

Yılmaz Isıkhan, Selen…[et al.]. Determining Cutoff Point of Ensemble Trees Based on Sample Size in Predicting Clinical Dose with DNA Microarray Data. Computational and Mathematical Methods in Medicine No. 2016 (2016), pp.1-9.
https://search.emarefa.net/detail/BIM-1100174

American Medical Association (AMA)

Yılmaz Isıkhan, Selen& Karabulut, Erdem& Alpar, Celal Reha. Determining Cutoff Point of Ensemble Trees Based on Sample Size in Predicting Clinical Dose with DNA Microarray Data. Computational and Mathematical Methods in Medicine. 2016. Vol. 2016, no. 2016, pp.1-9.
https://search.emarefa.net/detail/BIM-1100174

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1100174