Identification of DNA-Binding Proteins Using Support Vector Machine with Sequence Information
Joint Authors
Wu, Jiansheng
Ma, Xin
Xue, Xiaoyun
Source
Computational and Mathematical Methods in Medicine
Issue
Vol. 2013, Issue 2013 (31 Dec. 2013), pp.1-8, 8 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2013-09-16
Country of Publication
Egypt
No. of Pages
8
Main Subjects
Abstract EN
DNA-binding proteins are fundamentally important in understanding cellular processes.
Thus, the identification of DNA-binding proteins has the particularly important practical application in various fields, such as drug design.
We have proposed a novel approach method for predicting DNA-binding proteins using only sequence information.
The prediction model developed in this study is constructed by support vector machine-sequential minimal optimization (SVM-SMO) algorithm in conjunction with a hybrid feature.
The hybrid feature is incorporating evolutionary information feature, physicochemical property feature, and two novel attributes.
These two attributes use DNA-binding residues and nonbinding residues in a query protein to obtain DNA-binding propensity and nonbinding propensity.
The results demonstrate that our SVM-SMO model achieves 0.67 Matthew's correlation coefficient (MCC) and 89.6% overall accuracy with 88.4% sensitivity and 90.8% specificity, respectively.
Performance comparisons on various features indicate that two novel attributes contribute to the performance improvement.
In addition, our SVM-SMO model achieves the best performance than state-of-the-art methods on independent test dataset.
American Psychological Association (APA)
Ma, Xin& Wu, Jiansheng& Xue, Xiaoyun. 2013. Identification of DNA-Binding Proteins Using Support Vector Machine with Sequence Information. Computational and Mathematical Methods in Medicine،Vol. 2013, no. 2013, pp.1-8.
https://search.emarefa.net/detail/BIM-478530
Modern Language Association (MLA)
Ma, Xin…[et al.]. Identification of DNA-Binding Proteins Using Support Vector Machine with Sequence Information. Computational and Mathematical Methods in Medicine No. 2013 (2013), pp.1-8.
https://search.emarefa.net/detail/BIM-478530
American Medical Association (AMA)
Ma, Xin& Wu, Jiansheng& Xue, Xiaoyun. Identification of DNA-Binding Proteins Using Support Vector Machine with Sequence Information. Computational and Mathematical Methods in Medicine. 2013. Vol. 2013, no. 2013, pp.1-8.
https://search.emarefa.net/detail/BIM-478530
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-478530