Papers

Performance Analysis of a Class of Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition

Domestic Journal
2006~2010
작성자
한혜원
작성일
2010-08-01 01:11
조회
5974
Authors : Myung-Suk Song, Chang-Heon Lee, Seok-Pil Lee, Hong-Goo Kang

Year : 2010

Publisher / Conference : 한국음향학회지

Volume : 29, 제 2호

Page : 86-99

This paper analyzes the performance of various single channel speech enhancement algorithms when they are applied to automatic speech recognition (ASR) systems as a preprocessor. The functional modules of speech enhancement systems are first divided into four major modules such as a gain estimator, a noise power spectrum estimator, a priori signal to noise ratio (SNR) estimator, and a speech absence probability (SAP) estimator. We investigate the relationship between speech recognition accuracy and the roles of each module. Simulation results show that the Wiener filter outperforms other gain functions such as minimum mean square error-short time spectral amplitude (MMSE-STSA) and minimum mean square error-log spectral amplitude (MMSE-LSA) estimators when a perfect noise estimator is applied. When the performance of the noise estimator degrades, however, MMSE methods including the decision directed module to estimate a priori SNR and the SAP estimation module helps to improve the performance of the enhancement algorithm for speech recognition systems.
전체 381
57 International Journal Jung-In Lee, Jeung-Yoon Choi, Hong-Goo Kang "Refinement of Landmark Detection and Extraction of Articulator-Free Features for Knowledge-Based Speech Recognition" in IEICE Transactions on Information and Systems, vol.E96-D, No.3, pp.746-749, 2013
56 International Journal Chi-Sang Jung, Young-Sun Joo, Hong-Goo Kang "Waveform Interpolation-Based Speech Analysis/Synthesis for HMM-Based TTS Systems" in IEEE Signal Processing Letters, vol.19, issue 12, pp.809-812, 2012
55 International Conference Ho Seon Shin, Hong-Goo Kang, Tim Fingscheidt "Survey of Speech Enhancement Supported by a Bone Conduction Microphone" in Speech Communication; 10. ITG Symposium, 2012
54 International Journal Myung-Suk Song, Hong-Goo Kang "Single-channel dereverberation using a non-causal minimum variance distortionless response filter" in The Journal of the Acoustical Society of America, vol.132, issue 1, 2012
53 International Journal Jung-Won Lee, Jeung-Yoon Choi, Hong-Goo Kang "Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone" in The Journal of the Acoustical Society of America, vol.131, issue 2, 2012
52 International Journal Jae-Mo Yang, Hong-Goo Kang "Two-stage source tracking method using a multiple linear regression model in the expanded phase domain" in EURASIP Journal on Advances in Signal Processing, vol.5, 2012
51 International Conference Jinkyu Lee, Soonho Baek, Hong-Goo Kang "Signal and feature domain enhancement approaches for robust speech recognition" in 8th International Conference on Information, communications and Signal Processing, 2011
50 International Journal Min-Seok Choi, Hong-Goo Kang "Transient noise reduction in speech signal with a modified long-term predictor" in EURASIP Journal on Advances in Signal Processing, vol.141, 2011
49 International Journal Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "An Interactive 3-D Audio System With Loudspeakers" in IEEE Transactions on Multimedia, vol.13, issue 5, pp.844-855, 2011
48 International Journal Chi-Sang Jung, Hyunson Seo, Hong-Goo Kang "Estimating Redundancy Information of Selected Features in Multi-dimensional Pattern Classification" in Pattern Recognition Letters, vol.32, issue 4, pp.590-596, 2011