Papers

Performance Analysis of a Class of Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition

Domestic Journal
2006~2010
작성자
한혜원
작성일
2010-08-01 01:11
조회
3415
Authors : Myung-Suk Song, Chang-Heon Lee, Seok-Pil Lee, Hong-Goo Kang

Year : 2010

Publisher / Conference : 한국음향학회지

Volume : 29, 제 2호

Page : 86-99

This paper analyzes the performance of various single channel speech enhancement algorithms when they are applied to automatic speech recognition (ASR) systems as a preprocessor. The functional modules of speech enhancement systems are first divided into four major modules such as a gain estimator, a noise power spectrum estimator, a priori signal to noise ratio (SNR) estimator, and a speech absence probability (SAP) estimator. We investigate the relationship between speech recognition accuracy and the roles of each module. Simulation results show that the Wiener filter outperforms other gain functions such as minimum mean square error-short time spectral amplitude (MMSE-STSA) and minimum mean square error-log spectral amplitude (MMSE-LSA) estimators when a perfect noise estimator is applied. When the performance of the noise estimator degrades, however, MMSE methods including the decision directed module to estimate a priori SNR and the SAP estimation module helps to improve the performance of the enhancement algorithm for speech recognition systems.
전체 371
49 International Journal Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "An Interactive 3-D Audio System With Loudspeakers" in IEEE Transactions on Multimedia, vol.13, issue 5, pp.844-855, 2011
48 International Journal Chi-Sang Jung, Hyunson Seo, Hong-Goo Kang "Estimating Redundancy Information of Selected Features in Multi-dimensional Pattern Classification" in Pattern Recognition Letters, vol.32, issue 4, pp.590-596, 2011
47 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
46 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
45 International Journal Min-Seok Choi, Hong-Goo Kang "A Two-channel Noise Estimator for Speech Enhancement in Highly Non-stationary Environment" in IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue 4, pp.905-915, 2011
44 International Journal Chi-Sang Jung, Moo Young Kim, Hong-Goo Kang "Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information" in IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue 6, pp.1332-1340, 2010
43 Domestic Journal Myung-Suk Song, Chang-Heon Lee, Seok-Pil Lee, Hong-Goo Kang "Performance Analysis of a Class of Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition" in 한국음향학회지, vol.29, 제 2호, pp.86-99, 2010
42 International Conference Myung-Suk Song , Cha Zhang, Dinei Florencio, Hong-Goo Kang "Personal 3D audio system with loudspeakers" in ICME, 2010
41 International Conference Ho Seon Shin, Min-Seok Choi, Taesu Kim, Hong-Goo Kang "Binaural loudness based speech reinforcement with a closed-form solution" in ICASSP, 2010
40 International Journal Bong-Jin Lee, Chi-Sang Jung, Jeung-Yoon Choi, Hong-Goo Kang "On the Importance of Transition Regions for Automatic Speaker Recognition" in IEICE Transactions on Information and Systems, vol.E93-D, No.1, pp.197-200, 2010