A Novel Method of Predicting Protein Disordered Regions Based on Sequence Features

المؤلفون المشاركون

Zhao, Tong-Hui
Jiang, Min
Huang, Tao
Li, Bi-Qing
Cai, Yu-Dong
Li, Hai-Peng
Zhang, Ning

المصدر

BioMed Research International

العدد

المجلد 2013، العدد 2013 (31 ديسمبر/كانون الأول 2013)، ص ص. 1-8، 8ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2013-04-22

دولة النشر

مصر

عدد الصفحات

8

التخصصات الرئيسية

الطب البشري

الملخص EN

With a large number of disordered proteins and their important functions discovered, it is highly desired to develop effective methods to computationally predict protein disordered regions.

In this study, based on Random Forest (RF), Maximum Relevancy Minimum Redundancy (mRMR), and Incremental Feature Selection (IFS), we developed a new method to predict disordered regions in proteins.

The mRMR criterion was used to rank the importance of all candidate features.

Finally, top 128 features were selected from the ranked feature list to build the optimal model, including 92 Position Specific Scoring Matrix (PSSM) conservation score features and 36 secondary structure features.

As a result, Matthews correlation coefficient (MCC) of 0.3895 was achieved on the training set by 10-fold cross-validation.

On the basis of predicting results for each query sequence by using the method, we used the scanning and modification strategy to improve the performance.

The accuracy (ACC) and MCC were increased by 4% and almost 0.2%, respectively, compared with other three popular predictors: DISOPRED, DISOclust, and OnD-CRF.

The selected features may shed some light on the understanding of the formation mechanism of disordered structures, providing guidelines for experimental validation.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Zhao, Tong-Hui& Jiang, Min& Huang, Tao& Li, Bi-Qing& Zhang, Ning& Li, Hai-Peng…[et al.]. 2013. A Novel Method of Predicting Protein Disordered Regions Based on Sequence Features. BioMed Research International،Vol. 2013, no. 2013, pp.1-8.
https://search.emarefa.net/detail/BIM-1004260

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Zhao, Tong-Hui…[et al.]. A Novel Method of Predicting Protein Disordered Regions Based on Sequence Features. BioMed Research International No. 2013 (2013), pp.1-8.
https://search.emarefa.net/detail/BIM-1004260

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Zhao, Tong-Hui& Jiang, Min& Huang, Tao& Li, Bi-Qing& Zhang, Ning& Li, Hai-Peng…[et al.]. A Novel Method of Predicting Protein Disordered Regions Based on Sequence Features. BioMed Research International. 2013. Vol. 2013, no. 2013, pp.1-8.
https://search.emarefa.net/detail/BIM-1004260

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1004260