Papers

Vector Taylor Series based HMM Adaptation for Generalized Cepstrum in Noisy Environment

International Conference
2011~2015
작성자
한혜원
작성일
2013-12-01 00:37
조회
2869
Authors : Soonho Baek, Hong-Goo Kang

Year : 2013

Publisher / Conference : ASRU

This paper proposes a novel HMM adaptation algorithm for robust automatic speech recognition (ASR) system in noisy environments. The HMM adaptation using vector Taylor series (VTS) significantly improves the ASR performance in noisy environments. Recently, the power normalized cepstral coefficient (PNCC) that replaces a logarithmic mapping function with a power mapping function has been proposed and it is proved that the replacement of the mapping function is robust to additive noise. In this paper, we extend the VTS based approach to the cepstral coefficients obtained by using a power mapping function instead of a logarithmic mapping function. Experimental results indicate that HMM adaptation in the cepstrum obtained by using a power mapping function improves the ASR performance comparing the VTS based conventional approach for mel-frequency cepstral coefficients (MFCCs).
전체 370
240 International Conference Hyeonjoo Kang, JeeSok Lee, Soonho Baek, Hong-Goo Kang "Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems" in INTERSPEECH, 2015
239 International Conference Kyungguen Byun, Eunwoo Song, Hong-goo Kang "A constrained two-layer compression technique for ECG waves" in Enegineering in Medicine and Biology Society (EMBC), 2015
238 International Conference Eunwoo Song, Hong-Goo Kang "Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo" in INTERSPEECH, 2015
237 International Journal Taegyu Lee, Hyun Oh Oh, Jeongil Seo, Young-Cheol Park, Dae Hee Youn "Scalable Multiband Binaural Renderer for MPEG-H 3D Audio" in IEEE Journal of Selected Topics in Signal Processing, vol.9, issue 5, pp.907-920, 2015
236 International Conference Heejin Ahn, Eunwoo Song, Won-Suk Jun, Hong-goo Kang "A Compression Algorithms for Hidden Markov Model-Based Speech Synthesis Systems" in ITC-CSCC, pp.942-945, 2015
235 International Conference JeeSok Lee, Sejin Oh, Hong-Goo Kang "Coherent channel based subband multichannel dereverberation" in ICASSP, pp.2704-2708, 2015
234 International Conference Eunwoo Song, Young-Sun Joo, Hong-Goo Kang "Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system" in ICASSP, 2015
233 Domestic Journal 박영철, 이태규, 윤대희 "MPEG-H 3D 오디오 바이노럴 렌더링 기술 표준화" in 대한전기학회, 전기의 세계, vol.64, 제 2호, pp.27-31, 2015
232 International Journal Taegyu Lee, Yonghyun Baek, Young-Cheol Park, Dae Hee Youn "Stereo upmix-based binaural auralization for mobile devices" in IEEE Transactions on Consumer Electronics, vol.60, issue 3, pp.411-419, 2014
231 International Conference Eunwoo Song, Hong-Goo Kang, Joonil Lee "Fixed-point implementation of MPEG-D unified speech and audio coding decoder" in 19th International Conference on Digital Signal Processing (DSP), pp.110-113, 2014