Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection

Joint Authors

Guo, Jing
Sun, Xiao
Ma, Xin

Source

BioMed Research International

Issue

Vol. 2015, Issue 2015 (31 Dec. 2015), pp.1-10, 10 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2015-10-12

Country of Publication

Egypt

No. of Pages

10

Main Subjects

Medicine

Abstract EN

The prediction of RNA-binding proteins is one of the most challenging problems in computation biology.

Although some studies have investigated this problem, the accuracy of prediction is still not sufficient.

In this study, a highly accurate method was developed to predict RNA-binding proteins from amino acid sequences using random forests with the minimum redundancy maximum relevance (mRMR) method, followed by incremental feature selection (IFS).

We incorporated features of conjoint triad features and three novel features: binding propensity (BP), nonbinding propensity (NBP), and evolutionary information combined with physicochemical properties (EIPP).

The results showed that these novel features have important roles in improving the performance of the predictor.

Using the mRMR-IFS method, our predictor achieved the best performance (86.62% accuracy and 0.737 Matthews correlation coefficient).

High prediction accuracy and successful prediction performance suggested that our method can be a useful approach to identify RNA-binding proteins from sequence information.

American Psychological Association (APA)

Ma, Xin& Guo, Jing& Sun, Xiao. 2015. Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection. BioMed Research International،Vol. 2015, no. 2015, pp.1-10.
https://search.emarefa.net/detail/BIM-1055441

Modern Language Association (MLA)

Ma, Xin…[et al.]. Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection. BioMed Research International No. 2015 (2015), pp.1-10.
https://search.emarefa.net/detail/BIM-1055441

American Medical Association (AMA)

Ma, Xin& Guo, Jing& Sun, Xiao. Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection. BioMed Research International. 2015. Vol. 2015, no. 2015, pp.1-10.
https://search.emarefa.net/detail/BIM-1055441

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1055441