Papers

Signal and feature domain enhancement approaches for robust speech recognition

International Conference
2011~2015
작성자
한혜원
작성일
2011-12-13 00:15
조회
1301
Authors : Jinkyu Lee, Soonho Baek, Hong-Goo Kang

Year : 2011

Publisher / Conference : 8th International Conference on Information, communications and Signal Processing

This paper analyzes the impact of various preprocessing modules to improve the performance of automatic speech recognition system (ASR) in noisy environment. After choosing the state-of-the-art algorithms designed in the signal domain and feature domain, their performances in various noise conditions are thoroughly evaluated. Since the enhancement has been directly made to the features that are actually used for recognition, the feature domain approach is more appropriate than the signal domain approach. Experimental results show that the noise reduction in the feature domain gives the best performance.
전체 363
57 International Conference Se-Woon Jeon, Young-cheol Park, Dae Hee Youn "Acoustic depth rendering for 3D multimedia applications" in ICCE, 2012
56 International Conference Jinkyu Lee, Soonho Baek, Hong-Goo Kang "Signal and feature domain enhancement approaches for robust speech recognition" in 8th International Conference on Information, communications and Signal Processing, 2011
55 International Conference Dong-il Hyun, Young-cheol Park, Seok-pil Lee, Dae Hee Youn "Enhanced Interchannel Correlation (ICC) Synthesis for Spatial Audio Coding" in AES 43th International Conference, 2011
54 International Conference Se-Woon Jeon, Young-cheol Park, Seok-Pil Lee, Dae Hee Youn "Virtual Source Panning using Multiple-Wise Vector Base in the Multispeaker Stereo Format" in EUSIPCO, pp.1337-1341, 2011
53 International Conference Dong-il Hyun, Jeongil Seo, Young-cheol Park, Dae Hee Youn "Improved phase parameter analysis and synthesis for parametric stereo audio coding" in ICASSP, 2011
52 International Conference Jeongook Song, Hyen-o Oh, Hong-Goo Kong "Enhanced long-term predictor for Unified Speech and Audio Coding" in ICASSP, 2011
51 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
50 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
49 International Conference Ming Li, Chi-Sang Jung, Kyu J. Han "Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition" in INTERSPEECH, pp.2826-2829, 2010
48 International Conference Myung-Suk Song , Cha Zhang, Dinei Florencio, Hong-Goo Kang "Personal 3D audio system with loudspeakers" in ICME, 2010