Papers

Performance Analysis of a Class of Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition

Domestic Journal
2006~2010
작성자
한혜원
작성일
2010-08-01 01:11
조회
4812
Authors : Myung-Suk Song, Chang-Heon Lee, Seok-Pil Lee, Hong-Goo Kang

Year : 2010

Publisher / Conference : 한국음향학회지

Volume : 29, 제 2호

Page : 86-99

This paper analyzes the performance of various single channel speech enhancement algorithms when they are applied to automatic speech recognition (ASR) systems as a preprocessor. The functional modules of speech enhancement systems are first divided into four major modules such as a gain estimator, a noise power spectrum estimator, a priori signal to noise ratio (SNR) estimator, and a speech absence probability (SAP) estimator. We investigate the relationship between speech recognition accuracy and the roles of each module. Simulation results show that the Wiener filter outperforms other gain functions such as minimum mean square error-short time spectral amplitude (MMSE-STSA) and minimum mean square error-log spectral amplitude (MMSE-LSA) estimators when a perfect noise estimator is applied. When the performance of the noise estimator degrades, however, MMSE methods including the decision directed module to estimate a priori SNR and the SAP estimation module helps to improve the performance of the enhancement algorithm for speech recognition systems.
전체 377
177 International Conference Jeongook Song, Hyen-o Oh, Hong-Goo Kong "Enhanced long-term predictor for Unified Speech and Audio Coding" in ICASSP, 2011
176 Domestic Conference 노훈동,김성우,이충용,윤대희 "근거리장 방위각 추정을 위한 원거리장 근사화 기법 성능 평가 및 분석" in 한국음향학회, 2011
175 Domestic Journal 전세운, 박영철, 이석필, 윤대희 "다채널 스피커 환경에서 가상 음원을 생성하기 위한 레벨 패닝 알고리즘" in 한국음향학회지, vol.30, 제 4호, pp.197-206, 2011
174 Domestic Journal Yoomi Hur, Young-Cheol Park, Seok-Pil Lee, Dae Hee Youn "Efficient Individualization Method of HRTFs Using Critical-band Based Spectral Cue Control" in 한국음향학회지, vol.30, 제 4호, pp.167-180, 2011
173 International Journal Chi-Sang Jung, Hyunson Seo, Hong-Goo Kang "Estimating Redundancy Information of Selected Features in Multi-dimensional Pattern Classification" in Pattern Recognition Letters, vol.32, issue 4, pp.590-596, 2011
172 Domestic Journal 최민석, 신호선, 황영수, 강홍구 "음성 신호에서의 시간-주파수 축 충격 잡음 검출 시스템" in 한국음향학회지, vol.30, 제 2호, pp.73-79, 2011
171 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
170 International Journal Dong-il Hyun, Donggeum Lee, Youngcheol Park, Dae Hee Youn, Jeongil Seo "Joint Channel Coding Based on Principal Component Analysis" in ETRI Journal, vol.32, issue 5, pp.831-834, 2010
169 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
168 International Conference Ming Li, Chi-Sang Jung, Kyu J. Han "Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition" in INTERSPEECH, pp.2826-2829, 2010