An Improved Speech Segmentation and Clustering Algorithm Based on SOM and K-Means
المؤلفون المشاركون
المصدر
Mathematical Problems in Engineering
العدد
المجلد 2020، العدد 2020 (31 ديسمبر/كانون الأول 2020)، ص ص. 1-19، 19ص.
الناشر
Hindawi Publishing Corporation
تاريخ النشر
2020-09-12
دولة النشر
مصر
عدد الصفحات
19
التخصصات الرئيسية
الملخص EN
This paper studies the segmentation and clustering of speaker speech.
In order to improve the accuracy of speech endpoint detection, the traditional double-threshold short-time average zero-crossing rate is replaced by a better spectrum centroid feature, and the local maxima of the statistical feature sequence histogram are used to select the threshold, and a new speech endpoint detection algorithm is proposed.
Compared with the traditional double-threshold algorithm, it effectively improves the detection accuracy and antinoise in low SNR.
The k-means algorithm of conventional clustering needs to give the number of clusters in advance and is greatly affected by the choice of initial cluster centers.
At the same time, the self-organizing neural network algorithm converges slowly and cannot provide accurate clustering information.
An improved k-means speaker clustering algorithm based on self-organizing neural network is proposed.
The number of clusters is predicted by the winning situation of the competitive neurons in the trained network, and the weights of the neurons are used as the initial cluster centers of the k-means algorithm.
The experimental results of multiperson mixed speech segmentation show that the proposed algorithm can effectively improve the accuracy of speech clustering and make up for the shortcomings of the k-means algorithm and self-organizing neural network algorithm.
نمط استشهاد جمعية علماء النفس الأمريكية (APA)
Jiang, Nan& Liu, Ting. 2020. An Improved Speech Segmentation and Clustering Algorithm Based on SOM and K-Means. Mathematical Problems in Engineering،Vol. 2020, no. 2020, pp.1-19.
https://search.emarefa.net/detail/BIM-1194532
نمط استشهاد الجمعية الأمريكية للغات الحديثة (MLA)
Jiang, Nan& Liu, Ting. An Improved Speech Segmentation and Clustering Algorithm Based on SOM and K-Means. Mathematical Problems in Engineering No. 2020 (2020), pp.1-19.
https://search.emarefa.net/detail/BIM-1194532
نمط استشهاد الجمعية الطبية الأمريكية (AMA)
Jiang, Nan& Liu, Ting. An Improved Speech Segmentation and Clustering Algorithm Based on SOM and K-Means. Mathematical Problems in Engineering. 2020. Vol. 2020, no. 2020, pp.1-19.
https://search.emarefa.net/detail/BIM-1194532
نوع البيانات
مقالات
لغة النص
الإنجليزية
الملاحظات
Includes bibliographical references
رقم السجل
BIM-1194532
قاعدة معامل التأثير والاستشهادات المرجعية العربي "ارسيف Arcif"
أضخم قاعدة بيانات عربية للاستشهادات المرجعية للمجلات العلمية المحكمة الصادرة في العالم العربي
تقوم هذه الخدمة بالتحقق من التشابه أو الانتحال في الأبحاث والمقالات العلمية والأطروحات الجامعية والكتب والأبحاث باللغة العربية، وتحديد درجة التشابه أو أصالة الأعمال البحثية وحماية ملكيتها الفكرية. تعرف اكثر