PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method

Joint Authors

Wang, Jun
Zheng, Huiwen
Yang, Yang
Xiao, Wanyue
Liu, Taigang

Source

BioMed Research International

Issue

Vol. 2020, Issue 2020 (31 Dec. 2020), pp.1-8, 8 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2020-04-13

Country of Publication

Egypt

No. of Pages

8

Main Subjects

Medicine

Abstract EN

DNA-binding proteins (DBPs) play vital roles in all aspects of genetic activities.

However, the identification of DBPs by using wet-lab experimental approaches is often time-consuming and laborious.

In this study, we develop a novel computational method, called PredDBP-Stack, to predict DBPs solely based on protein sequences.

First, amino acid composition (AAC) and transition probability composition (TPC) extracted from the hidden markov model (HMM) profile are adopted to represent a protein.

Next, we establish a stacked ensemble model to identify DBPs, which involves two stages of learning.

In the first stage, the four base classifiers are trained with the features of HMM-based compositions.

In the second stage, the prediction probabilities of these base classifiers are used as inputs to the meta-classifier to perform the final prediction of DBPs.

Based on the PDB1075 benchmark dataset, we conduct a jackknife cross validation with the proposed PredDBP-Stack predictor and obtain a balanced sensitivity and specificity of 92.47% and 92.36%, respectively.

This outcome outperforms most of the existing classifiers.

Furthermore, our method also achieves superior performance and model robustness on the PDB186 independent dataset.

This demonstrates that the PredDBP-Stack is an effective classifier for accurately identifying DBPs based on protein sequence information alone.

American Psychological Association (APA)

Wang, Jun& Zheng, Huiwen& Yang, Yang& Xiao, Wanyue& Liu, Taigang. 2020. PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method. BioMed Research International،Vol. 2020, no. 2020, pp.1-8.
https://search.emarefa.net/detail/BIM-1136741

Modern Language Association (MLA)

Wang, Jun…[et al.]. PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method. BioMed Research International No. 2020 (2020), pp.1-8.
https://search.emarefa.net/detail/BIM-1136741

American Medical Association (AMA)

Wang, Jun& Zheng, Huiwen& Yang, Yang& Xiao, Wanyue& Liu, Taigang. PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method. BioMed Research International. 2020. Vol. 2020, no. 2020, pp.1-8.
https://search.emarefa.net/detail/BIM-1136741

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1136741