基于DTW 和EMD的孤立词语音识别研究

doi:10.3969/j.issn.1672-6952.2018.01.013

辽宁石油化工大学学报 ›› 2018, Vol. 38 ›› Issue (1): 74-78.DOI: 10.3969/j.issn.1672-6952.2018.01.013

基于DTW 和EMD的孤立词语音识别研究

徐必伟¹,苏成利¹,杨微²,曹江涛¹

1.辽宁石油化工大学信息与控制工程学院,辽宁抚顺113001;2.辽宁装备制造职业技术学院,辽宁沈阳110000

收稿日期:2017-05-04 修回日期:2017-05-20 出版日期:2018-02-28 发布日期:2018-02-27
通讯作者: :苏成利(1977-),男,博士,教授,从事模型预测控制、工业过程的先进控制与优化等方面研究;E-mail:sclwind@sina.com。
作者简介::徐必伟(1993-),男,硕士研究生,从事语音识别方面研究;E-mail:729255967@qq.com。
基金资助:
:国家自然科学基金项目(61673199);辽宁省高校优秀人才支持计划项目(LJQ2015061)。

Research on Isolated Word Speech Recognition Based on DTW and EMD

Xu Biwei¹, Su Chengli¹, Yang Wei², Cao Jiangtao¹

1.School of Information and Control Engineering, Liaoning Shihua University, Fushun Liaoning 113001, China;  2.Liaoning Equipment Manufacturing Vocational and Technical College, Shenyang Liaoning 110000, China

Received:2017-05-04 Revised:2017-05-20 Published:2018-02-28 Online:2018-02-27

摘要/Abstract

摘要： 针对语音识别过程中环境噪声干扰大的问题,提出一种基于经验模态分解(EMD)与动态时间规整 (DTW)相结合的孤立词识别算法。该方法利用EMD 算法,首先将提取的性能不好的语音信号分解成若干个基本模函数(IMF),去掉原始信号中的干扰和噪声。然后,基于DTW 算法,采用短时过零率和短时能量对语音信号进行端点检测,提取语音特征参数后与参考模板进行匹配。将参考模板与待测模板之间的最短路径作为识别结果。仿真结果表明,该算法能够提高语音的识别效率和识别的正确率。

关键词: 语音识别, 经验模态分解, 动态时间规整, 孤立词识别

Abstract:

In order to solve the problem of large interference of environmental noise during speech recognition, an isolated word recognition algorithm based on empirical mode decomposition (EMD) and dynamic time warping (DTW) is proposed. In this method, the EMD algorithm is used to decompose the speech signal with poor performance into several basic mode functions (IMF) and remove the interference and noise in the original signal. Then, based on the DTW algorithm, the short-time zero crossing rate and short-time energy are used to detect the endpoint detection of speech signal. After the speech feature parameters are extracted, the speech signal is matched with the reference template. Finally, the shortest path between the reference template and the template to be measured is used as the recognition result. The simulation results show that the proposed algorithm can improve the recognition efficiency of speech and the accuracy of recognition.

Key words: Speech recognition, Empirical mode decomposition, Dynamic time warping, Isolated word recognition

徐必伟,苏成利,杨微,曹江涛. 基于DTW 和EMD的孤立词语音识别研究[J]. 辽宁石油化工大学学报, 2018, 38(1): 74-78.

Xu Biwei, Su Chengli, Yang Wei, Cao Jiangtao. Research on Isolated Word Speech Recognition Based on DTW and EMD[J]. Journal of Liaoning Petrochemical University, 2018, 38(1): 74-78.

[1]	周学均, 陈小强, 谢磊, 江成龙. 基于EMD的短期风速预测混合模型[J]. 辽宁石油化工大学学报, 2021, 41(6): 79-86.
[2]	阚哲，杨凡，韩景宇，孙震，吴东旭. 基于MEEMD的配电网故障选线方法研究[J]. 辽宁石油化工大学学报, 2021, 41(2): 79-84.
[3]	付春,孙祥磊,王昆,等. 基于FastICA改进EMD的算法研究[J]. 辽宁石油化工大学学报, 2017, 37(5): 67-70.
[4]	段绪彭,曹江涛. 公共场所智能语音交互引导系统的设计研究[J]. 辽宁石油化工大学学报, 2015, 35(3): 56-60.
[5]	武荣华,王天施,侯宝明. 基于EMD的电能质量扰动检测与分类方法[J]. 辽宁石油化工大学学报, 2013, 33(2): 63-66.

基于DTW 和EMD的孤立词语音识别研究

Research on Isolated Word Speech Recognition Based on DTW and EMD

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 5

编辑推荐

Metrics