A Novel Bioinformatics Method for Efficient Knowledge Discovery by BLSOM from Big Genomic Sequence Data

Joint Authors

Iwasaki, Yuki
Ikemura, Toshimichi
Kanaya, Shigehiko
Bai, Yu
Zhao, Yue

Source

BioMed Research International

Issue

Vol. 2014, Issue 2014 (31 Dec. 2014), pp.1-11, 11 p.

Publisher

Hindawi Publishing Corporation

Publication Date

2014-04-03

Country of Publication

Egypt

No. of Pages

11

Main Subjects

Medicine

Abstract EN

With remarkable increase of genomic sequence data of a wide range of species, novel tools are needed for comprehensive analyses of the big sequence data.

Self-Organizing Map (SOM) is an effective tool for clustering and visualizing high-dimensional data such as oligonucleotide composition on one map.

By modifying the conventional SOM, we have previously developed Batch-Learning SOM (BLSOM), which allows classification of sequence fragments according to species, solely depending on the oligonucleotide composition.

In the present study, we introduce the oligonucleotide BLSOM used for characterization of vertebrate genome sequences.

We first analyzed pentanucleotide compositions in 100 kb sequences derived from a wide range of vertebrate genomes and then the compositions in the human and mouse genomes in order to investigate an efficient method for detecting differences between the closely related genomes.

BLSOM can recognize the species-specific key combination of oligonucleotide frequencies in each genome, which is called a “genome signature,” and the specific regions specifically enriched in transcription-factor-binding sequences.

Because the classification and visualization power is very high, BLSOM is an efficient powerful tool for extracting a wide range of information from massive amounts of genomic sequences (i.e., big sequence data).

American Psychological Association (APA)

Bai, Yu& Iwasaki, Yuki& Kanaya, Shigehiko& Zhao, Yue& Ikemura, Toshimichi. 2014. A Novel Bioinformatics Method for Efficient Knowledge Discovery by BLSOM from Big Genomic Sequence Data. BioMed Research International،Vol. 2014, no. 2014, pp.1-11.
https://search.emarefa.net/detail/BIM-497086

Modern Language Association (MLA)

Bai, Yu…[et al.]. A Novel Bioinformatics Method for Efficient Knowledge Discovery by BLSOM from Big Genomic Sequence Data. BioMed Research International No. 2014 (2014), pp.1-11.
https://search.emarefa.net/detail/BIM-497086

American Medical Association (AMA)

Bai, Yu& Iwasaki, Yuki& Kanaya, Shigehiko& Zhao, Yue& Ikemura, Toshimichi. A Novel Bioinformatics Method for Efficient Knowledge Discovery by BLSOM from Big Genomic Sequence Data. BioMed Research International. 2014. Vol. 2014, no. 2014, pp.1-11.
https://search.emarefa.net/detail/BIM-497086

Data Type

Journal Articles

Language

English

Notes

Includes bibliographical references

Record ID

BIM-497086