Classification of Complete Proteomes of Different Organisms and Protein Sets Based on Their Protein Distributions in Terms of Some Key Attributes of Proteins
Joint Authors
Guo, Hong
Yang, Xiaohan
Guo, Hao-Bo
Ma, Yue
Tuskan, Gerald A.
Source
International Journal of Genomics
Issue
Vol. 2018, Issue 2018 (31 Dec. 2018), pp.1-12, 12 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2018-03-04
Country of Publication
Egypt
No. of Pages
12
Main Subjects
Abstract EN
The existence of complete genome sequences makes it important to develop different approaches for classification of large-scale data sets and to make extraction of biological insights easier.
Here, we propose an approach for classification of complete proteomes/protein sets based on protein distributions on some basic attributes.
We demonstrate the usefulness of this approach by determining protein distributions in terms of two attributes: protein lengths and protein intrinsic disorder contents (ID).
The protein distributions based on L and ID are surveyed for representative proteome organisms and protein sets from the three domains of life.
The two-dimensional maps (designated as fingerprints here) from the protein distribution densities in the LD space defined by ln(L) and ID are then constructed.
The fingerprints for different organisms and protein sets are found to be distinct with each other, and they can therefore be used for comparative studies.
As a test case, phylogenetic trees have been constructed based on the protein distribution densities in the fingerprints of proteomes of organisms without performing any protein sequence comparison and alignments.
The phylogenetic trees generated are biologically meaningful, demonstrating that the protein distributions in the LD space may serve as unique phylogenetic signals of the organisms at the proteome level.
American Psychological Association (APA)
Guo, Hao-Bo& Ma, Yue& Tuskan, Gerald A.& Yang, Xiaohan& Guo, Hong. 2018. Classification of Complete Proteomes of Different Organisms and Protein Sets Based on Their Protein Distributions in Terms of Some Key Attributes of Proteins. International Journal of Genomics،Vol. 2018, no. 2018, pp.1-12.
https://search.emarefa.net/detail/BIM-1172938
Modern Language Association (MLA)
Guo, Hao-Bo…[et al.]. Classification of Complete Proteomes of Different Organisms and Protein Sets Based on Their Protein Distributions in Terms of Some Key Attributes of Proteins. International Journal of Genomics No. 2018 (2018), pp.1-12.
https://search.emarefa.net/detail/BIM-1172938
American Medical Association (AMA)
Guo, Hao-Bo& Ma, Yue& Tuskan, Gerald A.& Yang, Xiaohan& Guo, Hong. Classification of Complete Proteomes of Different Organisms and Protein Sets Based on Their Protein Distributions in Terms of Some Key Attributes of Proteins. International Journal of Genomics. 2018. Vol. 2018, no. 2018, pp.1-12.
https://search.emarefa.net/detail/BIM-1172938
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-1172938