SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq)‎ Data

Joint Authors

Tan, Yuxiang
Tambouret, Yann
Monti, Stefano

Source

BioMed Research International

Issue

Vol. 2015, Issue 2015 (31 Dec. 2015), pp.1-5, 5 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2015-12-29

Country of Publication

Egypt

No. of Pages

5

Main Subjects

Medicine

Abstract EN

The performance evaluation of fusion detection algorithms from high-throughput sequencing data crucially relies on the availability of data with known positive and negative cases of gene rearrangements.

The use of simulated data circumvents some shortcomings of real data by generation of an unlimited number of true and false positive events, and the consequent robust estimation of accuracy measures, such as precision and recall.

Although a few simulated fusion datasets from RNA Sequencing (RNA-Seq) are available, they are of limited sample size.

This makes it difficult to systematically evaluate the performance of RNA-Seq based fusion-detection algorithms.

Here, we present SimFuse to address this problem.

SimFuse utilizes real sequencing data as the fusions’ background to closely approximate the distribution of reads from a real sequencing library and uses a reference genome as the template from which to simulate fusions’ supporting reads.

To assess the supporting read-specific performance, SimFuse generates multiple datasets with various numbers of fusion supporting reads.

Compared to an extant simulated dataset, SimFuse gives users control over the supporting read features and the sample size of the simulated library, based on which the performance metrics needed for the validation and comparison of alternative fusion-detection algorithms can be rigorously estimated.

American Psychological Association (APA)

Tan, Yuxiang& Tambouret, Yann& Monti, Stefano. 2015. SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq) Data. BioMed Research International،Vol. 2015, no. 2015, pp.1-5.
https://search.emarefa.net/detail/BIM-1056694

Modern Language Association (MLA)

Tan, Yuxiang…[et al.]. SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq) Data. BioMed Research International No. 2015 (2015), pp.1-5.
https://search.emarefa.net/detail/BIM-1056694

American Medical Association (AMA)

Tan, Yuxiang& Tambouret, Yann& Monti, Stefano. SimFuse: A Novel Fusion Simulator for RNA Sequencing (RNA-Seq) Data. BioMed Research International. 2015. Vol. 2015, no. 2015, pp.1-5.
https://search.emarefa.net/detail/BIM-1056694

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-1056694