Environmental Sound Recognition Using Time-Frequency Intersection Patterns
Joint Authors
Huang, Jie
Toyoda, Yoshiyuki
Guo, Xuan
Ding, Shuxue
Li, Huankang
Liu, Yong
Source
Applied Computational Intelligence and Soft Computing
Issue
Vol. 2012, Issue 2012 (31 Dec. 2012), pp.1-6, 6 p.
Publisher
Hindawi Publishing Corporation
Publication Date
2012-05-10
Country of Publication
Egypt
No. of Pages
6
Main Subjects
Information Technology and Computer Science
Abstract EN
Environmental sound recognition is an important function of robots and intelligent computer systems.
In this research, we use a multistage perceptron neural network system for environmental sound recognition.
The input data is a combination of time-variance pattern of instantaneous powers and frequency-variance pattern with instantaneous spectrum at the power peak, referred to as a time-frequency intersection pattern.
Spectra of many environmental sounds change more slowly than those of speech or voice, so the intersectional time-frequency pattern will preserve the major features of environmental sounds but with drastically reduced data requirements.
Two experiments were conducted using an original database and an open database created by the RWCP project.
The recognition rate for 20 kinds of environmental sounds was 92%.
The recognition rate of the new method was about 12% higher than methods using only an instantaneous spectrum.
The results are also comparable with HMM-based methods, although those methods need to treat the time variance of an input vector series with more complicated computations.
American Psychological Association (APA)
Guo, Xuan& Toyoda, Yoshiyuki& Li, Huankang& Huang, Jie& Ding, Shuxue& Liu, Yong. 2012. Environmental Sound Recognition Using Time-Frequency Intersection Patterns. Applied Computational Intelligence and Soft Computing،Vol. 2012, no. 2012, pp.1-6.
https://search.emarefa.net/detail/BIM-488251
Modern Language Association (MLA)
Guo, Xuan…[et al.]. Environmental Sound Recognition Using Time-Frequency Intersection Patterns. Applied Computational Intelligence and Soft Computing No. 2012 (2012), pp.1-6.
https://search.emarefa.net/detail/BIM-488251
American Medical Association (AMA)
Guo, Xuan& Toyoda, Yoshiyuki& Li, Huankang& Huang, Jie& Ding, Shuxue& Liu, Yong. Environmental Sound Recognition Using Time-Frequency Intersection Patterns. Applied Computational Intelligence and Soft Computing. 2012. Vol. 2012, no. 2012, pp.1-6.
https://search.emarefa.net/detail/BIM-488251
Data Type
Journal Articles
Language
English
Notes
Includes bibliographical references
Record ID
BIM-488251