Papers

Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

International Conference
2011~2015
작성자
한혜원
작성일
2014-05-01 00:42
조회
1281
Authors : Soonho Baek, Hong-Goo Kang

Year : 2014

Publisher / Conference : ICASSP

This paper presents the effect of mean normalization to various types of cepstral coefficients for robust speech recognition in noisy environments. Although the cepstral mean normalization (CMN) technique was originally designed to compensate channel distortion, it has also been proved that the CMN also improves recognition accuracy in additive noisy environment. However, no one has yet considered the interaction of CMN with spectral mapping functions required for extracting cepstral features. This paper investigates the impact of CMN to the speech recognition system depending on the types of spectral mapping function by mathematically analyzing the amount of spectral distortion between clean and noisy conditions. The analytic result is also confirmed by comparing the type of recognition error patterns in automatic speech recognition experiment with Aurora 2 database. Experimental results show that the performance improvement by adopting CMN becomes significant if the logarithmic function is replaced with the appropriate setting of fractional power mapping function. Especially, the deletion errors are dramatically reduced.
전체 355
62 International Journal Taegyu Lee, Yonghyun Baek, Young-Cheol Park, Dae Hee Youn "Stereo upmix-based binaural auralization for mobile devices" in IEEE Transactions on Consumer Electronics, vol.60, issue 3, pp.411-419, 2014
61 International Conference Eunwoo Song, Hong-Goo Kang, Joonil Lee "Fixed-point implementation of MPEG-D unified speech and audio coding decoder" in 19th International Conference on Digital Signal Processing (DSP), pp.110-113, 2014
60 International Journal Soonho Baek, Hong-Goo Kang "Selection of spectral compressive operator for vector Taylor series-based model adaptation in noisy environments" in The Journal of the Acoustical Society of America, vol.135, 2014
59 International Conference Soonho Baek, Hong-Goo Kang "Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment" in ICASSP, 2014
58 International Journal Jae-Mo Yang, Hong-Goo Kang "Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction" in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue 3, pp.608-619, 2014
57 International Conference Ho Seon Shin, Hong-Goo Kang "Bone-Conduction Speech Enhancement using a Speaker-Independent Filter" in ICEIC, 2014
56 Domestic Journal 오현오, 이태규, 전세운, 윤대희, 박영철, 서정일, 이용주 "모바일 3D 사운드 : 바이노럴 오디오 기술 동향" in 방송공학회논문지, vol.19, 제 1호, pp.65-74, 2014
55 International Journal Taegyu Lee, Yonghyun Baek, Young-Cheol Park, Dae Hee Youn "Stereo upmix-based binaural auralization for mobile devices" in IEEE Transactions on Consumer Electronics, vol.60, issue.3, pp.411-419, 2014
54 International Journal Jae-Mo Yang, Hong-Goo Kang "An Efficient Multichannel Linear Prediction-Based Blind Equalization Algorithm in Near Common Zeros Condition" in IEEE Signal Processing Letters, vol.21, issue 3, pp.306-310, 2014
53 Domestic Journal 현동일, 박영철, 윤대희 "파라메트릭 스테레오 오디오 부호화를 위한 향상된 위상 합성 기법" in 전자공학회논문지, vol.50, 제 12호, pp.184-190, 2013