Papers

Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

International Conference
2011~2015
작성자
한혜원
작성일
2014-05-01 00:42
조회
228
Authors : Soonho Baek, Hong-Goo Kang

Year : 2014

Publisher / Conference : ICASSP

This paper presents the effect of mean normalization to various types of cepstral coefficients for robust speech recognition in noisy environments. Although the cepstral mean normalization (CMN) technique was originally designed to compensate channel distortion, it has also been proved that the CMN also improves recognition accuracy in additive noisy environment. However, no one has yet considered the interaction of CMN with spectral mapping functions required for extracting cepstral features. This paper investigates the impact of CMN to the speech recognition system depending on the types of spectral mapping function by mathematically analyzing the amount of spectral distortion between clean and noisy conditions. The analytic result is also confirmed by comparing the type of recognition error patterns in automatic speech recognition experiment with Aurora 2 database. Experimental results show that the performance improvement by adopting CMN becomes significant if the logarithmic function is replaced with the appropriate setting of fractional power mapping function. Especially, the deletion errors are dramatically reduced.
전체 319
229 International Conference Soonho Baek, Hong-Goo Kang "Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment" in ICASSP, 2014
228 International Journal Jae-Mo Yang, Hong-Goo Kang "Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction" in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue 3, pp.608-619, 2014
227 International Conference Ho Seon Shin, Hong-Goo Kang "Bone-Conduction Speech Enhancement using a Speaker-Independent Filter" in ICEIC, 2014
226 Domestic Journal 오현오, 이태규, 전세운, 윤대희, 박영철, 서정일, 이용주 "모바일 3D 사운드 : 바이노럴 오디오 기술 동향" in 방송공학회논문지, vol.19, 제 1호, pp.65-74, 2014
225 International Journal Taegyu Lee, Yonghyun Baek, Young-Cheol Park, Dae Hee Youn "Stereo upmix-based binaural auralization for mobile devices" in IEEE Transactions on Consumer Electronics, vol.60, issue.3, pp.411-419, 2014
224 International Journal Jae-Mo Yang, Hong-Goo Kang "An Efficient Multichannel Linear Prediction-Based Blind Equalization Algorithm in Near Common Zeros Condition" in IEEE Signal Processing Letters, vol.21, issue 3, pp.306-310, 2014
223 Domestic Journal 현동일, 박영철, 윤대희 "파라메트릭 스테레오 오디오 부호화를 위한 향상된 위상 합성 기법" in 전자공학회논문지, vol.50, 제 12호, pp.184-190, 2013
222 International Conference Soonho Baek, Hong-Goo Kang "Vector Taylor Series based HMM Adaptation for Generalized Cepstrum in Noisy Environment" in ASRU, 2013
221 Domestic Conference 변경근, 서지호, 강홍구 "스펙트로그램의 양방향 상관계수를 이용한 음질 개선" in 한국음향학회 추계학술발표대회, 2013
220 Domestic Conference 강현주, 문현기, 강홍구 "잡음 제거와 하모닉 강화 알고리듬을 이용한 통합 음질 개선 기술" in 한국음향학회 추계학술발표대회, 2013