Error Analysis of Deep Sequencing of Phage Libraries : Peptides Censored in Sequencing

Joint Authors

Matochko, Wadim L.
Derda, Ratmir

Source

Computational and Mathematical Methods in Medicine

Issue

Vol. 2013, Issue 2013 (31 Dec. 2013), pp.1-13, 13 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2013-12-12

Country of Publication

Egypt

No. of Pages

13

Main Subjects

Medicine

Abstract EN

Next-generation sequencing techniques empower selection of ligands from phage-display libraries because they can detect low abundant clones and quantify changes in the copy numbers of clones without excessive selection rounds.

Identification of errors in deep sequencing data is the most critical step in this process because these techniques have error rates >1%.

Mechanisms that yield errors in Illumina and other techniques have been proposed, but no reports to date describe error analysis in phage libraries.

Our paper focuses on error analysis of 7-mer peptide libraries sequenced by Illumina method.

Low theoretical complexity of this phage library, as compared to complexity of long genetic reads and genomes, allowed us to describe this library using convenient linear vector and operator framework.

We describe a phage library as N×1 frequency vector n=ni, where ni is the copy number of the ith sequence and N is the theoretical diversity, that is, the total number of all possible sequences.

Any manipulation to the library is an operator acting on n.

Selection, amplification, or sequencing could be described as a product of a N×N matrix and a stochastic sampling operator (Sa).

The latter is a random diagonal matrix that describes sampling of a library.

In this paper, we focus on the properties of Sa and use them to define the sequencing operator (Seq).

Sequencing without any bias and errors is Seq=Sa IN, where IN is a N×N unity matrix.

Any bias in sequencing changes IN to a nonunity matrix.

We identified a diagonal censorship matrix (CEN), which describes elimination or statistically significant downsampling, of specific reads during the sequencing process.

American Psychological Association (APA)

Matochko, Wadim L.& Derda, Ratmir. 2013. Error Analysis of Deep Sequencing of Phage Libraries : Peptides Censored in Sequencing. Computational and Mathematical Methods in Medicine،Vol. 2013, no. 2013, pp.1-13.
https://search.emarefa.net/detail/BIM-475872

Modern Language Association (MLA)

Matochko, Wadim L.& Derda, Ratmir. Error Analysis of Deep Sequencing of Phage Libraries : Peptides Censored in Sequencing. Computational and Mathematical Methods in Medicine No. 2013 (2013), pp.1-13.
https://search.emarefa.net/detail/BIM-475872

American Medical Association (AMA)

Matochko, Wadim L.& Derda, Ratmir. Error Analysis of Deep Sequencing of Phage Libraries : Peptides Censored in Sequencing. Computational and Mathematical Methods in Medicine. 2013. Vol. 2013, no. 2013, pp.1-13.
https://search.emarefa.net/detail/BIM-475872

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-475872