Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification

Joint Authors

Xie, Lu
Ling, Zongxin
Wang, Yin
Li, Rudong
Zhou, Yuhua
Guo, Xiaokui
Liu, Lei

Source

BioMed Research International

Issue

Vol. 2016, Issue 2016 (31 Dec. 2016), pp.1-11, 11 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2016-02-14

Country of Publication

Egypt

No. of Pages

11

Main Subjects

Medicine

Abstract EN

Background.

Text data of 16S rRNA are informative for classifications of microbiota-associated diseases.

However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty.

Results.

Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data.

By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets.

Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods.

Conclusions.

We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries.

The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

American Psychological Association (APA)

Wang, Yin& Li, Rudong& Zhou, Yuhua& Ling, Zongxin& Guo, Xiaokui& Xie, Lu…[et al.]. 2016. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification. BioMed Research International،Vol. 2016, no. 2016, pp.1-11.
https://search.emarefa.net/detail/BIM-1098522

Modern Language Association (MLA)

Wang, Yin…[et al.]. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification. BioMed Research International No. 2016 (2016), pp.1-11.
https://search.emarefa.net/detail/BIM-1098522

American Medical Association (AMA)

Wang, Yin& Li, Rudong& Zhou, Yuhua& Ling, Zongxin& Guo, Xiaokui& Xie, Lu…[et al.]. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification. BioMed Research International. 2016. Vol. 2016, no. 2016, pp.1-11.
https://search.emarefa.net/detail/BIM-1098522

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1098522