Discovering the Unknown: Improving Detection of Novel Species and Genera from Short Reads

المؤلفون المشاركون

Essinger, Steven D.
Rosen, Gail L.
Polikar, Robi
Caseiro, Diamantino A.
Sokhansanj, Bahrad A.

المصدر

BioMed Research International

العدد

المجلد 2011، العدد 2011 (31 ديسمبر/كانون الأول 2011)، ص ص. 1-11، 11ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2011-02-17

دولة النشر

مصر

عدد الصفحات

11

التخصصات الرئيسية

الطب البشري

الملخص EN

High-throughput sequencing technologies enable metagenome profiling, simultaneous sequencing of multiple microbial species present within an environmental sample.

Since metagenomic data includes sequence fragments (“reads”) from organisms that are absent from any database, new algorithms must be developed for the identification and annotation of novel sequence fragments.

Homology-based techniques have been modified to detect novel species and genera, but, composition-based methods, have not been adapted.

We develop a detection technique that can discriminate between “known” and “unknown” taxa, which can be used with composition-based methods, as well as a hybrid method.

Unlike previous studies, we rigorously evaluate all algorithms for their ability to detect novel taxa.

First, we show that the integration of a detector with a composition-based method performs significantly better than homology-based methods for the detection of novel species and genera, with best performance at finer taxonomic resolutions.

Most importantly, we evaluate all the algorithms by introducing an “unknown” class and show that the modified version of PhymmBL has similar or better overall classification performance than the other modified algorithms, especially for the species-level and ultrashort reads.

Finally, we evaluate the performance of several algorithms on a real acid mine drainage dataset.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Rosen, Gail L.& Polikar, Robi& Caseiro, Diamantino A.& Essinger, Steven D.& Sokhansanj, Bahrad A.. 2011. Discovering the Unknown: Improving Detection of Novel Species and Genera from Short Reads. BioMed Research International،Vol. 2011, no. 2011, pp.1-11.
https://search.emarefa.net/detail/BIM-990153

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Rosen, Gail L.…[et al.]. Discovering the Unknown: Improving Detection of Novel Species and Genera from Short Reads. BioMed Research International No. 2011 (2011), pp.1-11.
https://search.emarefa.net/detail/BIM-990153

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Rosen, Gail L.& Polikar, Robi& Caseiro, Diamantino A.& Essinger, Steven D.& Sokhansanj, Bahrad A.. Discovering the Unknown: Improving Detection of Novel Species and Genera from Short Reads. BioMed Research International. 2011. Vol. 2011, no. 2011, pp.1-11.
https://search.emarefa.net/detail/BIM-990153

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-990153