PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method

المؤلفون المشاركون

Wang, Jun
Zheng, Huiwen
Yang, Yang
Xiao, Wanyue
Liu, Taigang

المصدر

BioMed Research International

العدد

المجلد 2020، العدد 2020 (31 ديسمبر/كانون الأول 2020)، ص ص. 1-8، 8ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2020-04-13

دولة النشر

مصر

عدد الصفحات

8

التخصصات الرئيسية

الطب البشري

الملخص EN

DNA-binding proteins (DBPs) play vital roles in all aspects of genetic activities.

However, the identification of DBPs by using wet-lab experimental approaches is often time-consuming and laborious.

In this study, we develop a novel computational method, called PredDBP-Stack, to predict DBPs solely based on protein sequences.

First, amino acid composition (AAC) and transition probability composition (TPC) extracted from the hidden markov model (HMM) profile are adopted to represent a protein.

Next, we establish a stacked ensemble model to identify DBPs, which involves two stages of learning.

In the first stage, the four base classifiers are trained with the features of HMM-based compositions.

In the second stage, the prediction probabilities of these base classifiers are used as inputs to the meta-classifier to perform the final prediction of DBPs.

Based on the PDB1075 benchmark dataset, we conduct a jackknife cross validation with the proposed PredDBP-Stack predictor and obtain a balanced sensitivity and specificity of 92.47% and 92.36%, respectively.

This outcome outperforms most of the existing classifiers.

Furthermore, our method also achieves superior performance and model robustness on the PDB186 independent dataset.

This demonstrates that the PredDBP-Stack is an effective classifier for accurately identifying DBPs based on protein sequence information alone.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Wang, Jun& Zheng, Huiwen& Yang, Yang& Xiao, Wanyue& Liu, Taigang. 2020. PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method. BioMed Research International،Vol. 2020, no. 2020, pp.1-8.
https://search.emarefa.net/detail/BIM-1136741

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Wang, Jun…[et al.]. PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method. BioMed Research International No. 2020 (2020), pp.1-8.
https://search.emarefa.net/detail/BIM-1136741

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Wang, Jun& Zheng, Huiwen& Yang, Yang& Xiao, Wanyue& Liu, Taigang. PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method. BioMed Research International. 2020. Vol. 2020, no. 2020, pp.1-8.
https://search.emarefa.net/detail/BIM-1136741

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1136741