Performance Analysis of a Class of Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition

Domestic Journal
2010-08-01 01:11
Authors : Myung-Suk Song, Chang-Heon Lee, Seok-Pil Lee, Hong-Goo Kang

Year : 2010

Publisher / Conference : 한국음향학회지

Volume : 29, 제 2호

Page : 86-99

This paper analyzes the performance of various single channel speech enhancement algorithms when they are applied to automatic speech recognition (ASR) systems as a preprocessor. The functional modules of speech enhancement systems are first divided into four major modules such as a gain estimator, a noise power spectrum estimator, a priori signal to noise ratio (SNR) estimator, and a speech absence probability (SAP) estimator. We investigate the relationship between speech recognition accuracy and the roles of each module. Simulation results show that the Wiener filter outperforms other gain functions such as minimum mean square error-short time spectral amplitude (MMSE-STSA) and minimum mean square error-log spectral amplitude (MMSE-LSA) estimators when a perfect noise estimator is applied. When the performance of the noise estimator degrades, however, MMSE methods including the decision directed module to estimate a priori SNR and the SAP estimation module helps to improve the performance of the enhancement algorithm for speech recognition systems.
전체 327
128 Domestic Journal Myung-Suk Song, Chang-Heon Lee, Seok-Pil Lee, Hong-Goo Kang "Performance Analysis of a Class of Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition" in 한국음향학회지, vol.29, 제 2호, pp.86-99, 2010
127 International Conference Myung-Suk Song , Cha Zhang, Dinei Florencio, Hong-Goo Kang "Personal 3D audio system with loudspeakers" in ICME, 2010
126 Domestic Journal 정재웅, 박영철, 윤대희, 이석필 "주파수 워핑된 공통 극점을 이용한 음향 간섭제거기의 설계 및 구현" in 한국음향학회지, vol.29, 제 5호, pp.339-346, 2010
125 Domestic Conference 백순호,양재모,이석필,강홍구 "주파수 밴드 별 엔트로피 변화에 따른 잔향제거 기술" in 한국통신학회, 2010
124 Domestic Conference 전세운,박영철,이석필,윤대희 "방향 정보에 따른 가변적인 채널 게인 제어를 통한 다채널 벡터 기반의 사운드 패닝 기술에 관한 연구" in 한국통신학회, 2010
123 International Conference Se-Woon Jeon, Young-Cheol Park, Seok-Pil Lee, Dae Hee Youn "Robust Representation of Spatial Sound in Stereo-to-Multichannel Upmix" in 128th Convention of Audio Engineering Society, pp.7976, 2010
122 International Conference Se-Woon Jeon, Dongil Hyun, Jeongil Seo, Young-Cheol Park, Dae Hee Youn "Enhancement of principal to ambient energy ratio for PCA-based parametric audio coding" in ICASSP, 2010
121 International Conference Ho Seon Shin, Min-Seok Choi, Taesu Kim, Hong-Goo Kang "Binaural loudness based speech reinforcement with a closed-form solution" in ICASSP, 2010
120 Domestic Conference 김성우, 박영철, 윤대희, 조점군 "소음 환경에서 다양한 빔 형성기법들의 성능 평가" in 2010 국방과학40주년학술대회, 2010
119 Domestic Conference 양재모,강홍구 "중계기시스템환경에서 피드백 신호 제거를 위한 DCT-LMS 적응필터 알고리즘" in 한국통신학회, 2010