Papers

Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

International Conference
2011~2015
작성자
한혜원
작성일
2014-05-01 00:42
조회
3609
Authors : Soonho Baek, Hong-Goo Kang

Year : 2014

Publisher / Conference : ICASSP

This paper presents the effect of mean normalization to various types of cepstral coefficients for robust speech recognition in noisy environments. Although the cepstral mean normalization (CMN) technique was originally designed to compensate channel distortion, it has also been proved that the CMN also improves recognition accuracy in additive noisy environment. However, no one has yet considered the interaction of CMN with spectral mapping functions required for extracting cepstral features. This paper investigates the impact of CMN to the speech recognition system depending on the types of spectral mapping function by mathematically analyzing the amount of spectral distortion between clean and noisy conditions. The analytic result is also confirmed by comparing the type of recognition error patterns in automatic speech recognition experiment with Aurora 2 database. Experimental results show that the performance improvement by adopting CMN becomes significant if the logarithmic function is replaced with the appropriate setting of fractional power mapping function. Especially, the deletion errors are dramatically reduced.
전체 372
272 International Conference Seung-chul Shin, Sangyeop Lee, Taeho Lee, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Two electrode based healthcare device for continuously monitoring ECG and BIA signals" in BHI, 2018
271 International Journal JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "Generic uniform search grid generation algorithm for far-field source localization" in The Journal of the Acoustical Society of America, vol.143, 2018
270 International Journal Min-Jae Hwang, JeeSok Lee, MiSuk Lee, Hong-Goo Kang "SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals" in IEEE Transactions on Multimedia, vol.20, issue 1, pp.45-54, 2018
269 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems" in ASRU, 2017
268 Domestic Conference 양해민, 강홍구 "잡음 예측을 위한 심층 신경망기반 음성 존재 확률 계산법" in 대한전자공학회 추계학술대회, 2017
267 Domestic Conference 오상신, 정수환, 강홍구 "음성 인식 기반의 방송미디어 디바이스 제어 및 편집 시스템 구현" in 대한전자공학회 추계학술대회, 2017
266 International Journal Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems" in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue 11, pp.2152-2161, 2017
265 International Conference Seung-chul Shin, Junhyung Moon, Saewon Kye, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Continuous bladder volume monitoring system for wearable applications" in EMBC, 2017
264 Domestic Conference 김정규, 박영철, 강홍구 "저사양 TV 사운드 설계환경을 위한 IIR 필터 기반 주파수 등화기" in 대한전자공학회 학술대회, 2017
263 International Conference Jinkyu Lee, Keulbit Kim, Turaj Shabestary, Hong-Goo Kang "Deep bi-directional long short-term memory based speech enhancement for wind noise reduction" in HSCMA, 2017