Papers

Signal and feature domain enhancement approaches for robust speech recognition

International Conference
2011~2015
작성자
한혜원
작성일
2011-12-13 00:15
조회
204
Authors : Jinkyu Lee, Soonho Baek, Hong-Goo Kang

Year : 2011

Publisher / Conference : 8th International Conference on Information, communications and Signal Processing

This paper analyzes the impact of various preprocessing modules to improve the performance of automatic speech recognition system (ASR) in noisy environment. After choosing the state-of-the-art algorithms designed in the signal domain and feature domain, their performances in various noise conditions are thoroughly evaluated. Since the enhancement has been directly made to the features that are actually used for recognition, the feature domain approach is more appropriate than the signal domain approach. Experimental results show that the noise reduction in the feature domain gives the best performance.
전체 327
197 Domestic Journal 최선웅, 현동일, 이석필, 박영철, 윤대희 "입체음향효과 향상을 위한 스테레오-10.2채널 블라인드 업믹스 기법" in 한국음향학회지, vol.31, 제 5호, pp.340-351, 2012
196 International Journal Myung-Suk Song, Hong-Goo Kang "Single-channel dereverberation using a non-causal minimum variance distortionless response filter" in The Journal of the Acoustical Society of America, vol.132, issue 1, 2012
195 International Conference Seong-woo Kim, Young-cheol Park, Dae Hee Youn "A variable step-size filtered-x gradient adaptive lattice algorithm for active noise control" in ICASSP, 2012
194 International Journal Jung-Won Lee, Jeung-Yoon Choi, Hong-Goo Kang "Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone" in The Journal of the Acoustical Society of America, vol.131, issue 2, 2012
193 International Conference Se-Woon Jeon, Young-cheol Park, Dae Hee Youn "Acoustic depth rendering for 3D multimedia applications" in ICCE, 2012
192 International Journal Jae-Mo Yang, Hong-Goo Kang "Two-stage source tracking method using a multiple linear regression model in the expanded phase domain" in EURASIP Journal on Advances in Signal Processing, vol.5, 2012
191 International Conference Jinkyu Lee, Soonho Baek, Hong-Goo Kang "Signal and feature domain enhancement approaches for robust speech recognition" in 8th International Conference on Information, communications and Signal Processing, 2011
190 International Journal Min-Seok Choi, Hong-Goo Kang "Transient noise reduction in speech signal with a modified long-term predictor" in EURASIP Journal on Advances in Signal Processing, vol.141, 2011
189 International Journal Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "An Interactive 3-D Audio System With Loudspeakers" in IEEE Transactions on Multimedia, vol.13, issue 5, pp.844-855, 2011
188 International Conference Dong-il Hyun, Young-cheol Park, Seok-pil Lee, Dae Hee Youn "Enhanced Interchannel Correlation (ICC) Synthesis for Spatial Audio Coding" in AES 43th International Conference, 2011