Long Short-Term Memory Projection Recurrent Neural Network Architectures for Piano’s Continuous Note Recognition

المؤلفون المشاركون

Xu, Yanyan
Su, Kaile
Jia, YuKang
Wu, Zhicheng
Ke, Dengfeng

المصدر

Journal of Robotics

العدد

المجلد 2017، العدد 2017 (31 ديسمبر/كانون الأول 2017)، ص ص. 1-7، 7ص.

الناشر

Hindawi Publishing Corporation

تاريخ النشر

2017-09-12

دولة النشر

مصر

عدد الصفحات

7

التخصصات الرئيسية

هندسة ميكانيكية

الملخص EN

Long Short-Term Memory (LSTM) is a kind of Recurrent Neural Networks (RNN) relating to time series, which has achieved good performance in speech recogniton and image recognition.

Long Short-Term Memory Projection (LSTMP) is a variant of LSTM to further optimize speed and performance of LSTM by adding a projection layer.

As LSTM and LSTMP have performed well in pattern recognition, in this paper, we combine them with Connectionist Temporal Classification (CTC) to study piano’s continuous note recognition for robotics.

Based on the Beijing Forestry University music library, we conduct experiments to show recognition rates and numbers of iterations of LSTM with a single layer, LSTMP with a single layer, and Deep LSTM (DLSTM, LSTM with multilayers).

As a result, the single layer LSTMP proves performing much better than the single layer LSTM in both time and the recognition rate; that is, LSTMP has fewer parameters and therefore reduces the training time, and, moreover, benefiting from the projection layer, LSTMP has better performance, too.

The best recognition rate of LSTMP is 99.8%.

As for DLSTM, the recognition rate can reach 100% because of the effectiveness of the deep structure, but compared with the single layer LSTMP, DLSTM needs more training time.

نمط استشهاد جمعية علماء النفس الأمريكية (APA)

Jia, YuKang& Wu, Zhicheng& Xu, Yanyan& Ke, Dengfeng& Su, Kaile. 2017. Long Short-Term Memory Projection Recurrent Neural Network Architectures for Piano’s Continuous Note Recognition. Journal of Robotics،Vol. 2017, no. 2017, pp.1-7.
https://search.emarefa.net/detail/BIM-1186322

نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)

Jia, YuKang…[et al.]. Long Short-Term Memory Projection Recurrent Neural Network Architectures for Piano’s Continuous Note Recognition. Journal of Robotics No. 2017 (2017), pp.1-7.
https://search.emarefa.net/detail/BIM-1186322

نمط استشهاد الجمعية الطبية الأمريكية (AMA)

Jia, YuKang& Wu, Zhicheng& Xu, Yanyan& Ke, Dengfeng& Su, Kaile. Long Short-Term Memory Projection Recurrent Neural Network Architectures for Piano’s Continuous Note Recognition. Journal of Robotics. 2017. Vol. 2017, no. 2017, pp.1-7.
https://search.emarefa.net/detail/BIM-1186322

نوع البيانات

مقالات

لغة النص

الإنجليزية

الملاحظات

Includes bibliographical references

رقم السجل

BIM-1186322