Papers

An Efficient Segment-Based Speech Compression Technique for Hand-Held TTS Systems

International Conference
2006~2010
작성자
한혜원
작성일
2006-09-17 22:40
조회
1006
Authors : Chang-Heon Lee, Sung-Kyo Jung, Thomas Eriksson, Won-Suk Jun, Hong-Goo Kang

Year : 2006

Publisher / Conference : INTERSPEECH

This paper proposes a novel segment-based speech coding algorithm to efficiently compress the database for concatenative text-to-speech (TTS) systems. To achieve a high compression ratio and meet the fundamental requirements of concatenative TTS synthesizers, i.e. partial segment decoding and random access capability, we adopt a modified analysis-by-synthesis scheme. The spectral coefficients are quantized by a length-based interpolation method and excitation signals are modeled with both non-predictive and predictive approaches. Considering that pitch pulse waveforms of a specific speaker show low intra-variation, the conventional adaptive codebook for pitch prediction is replaced by a speaker dependent pitch-pulse codebook. By applying the proposed algorithm to a hand-held Korean TTS system, we verify that the proposed coder provides a compression ratio of about 1/13, a low complexity of around 1.2 WMOPS, and random access capability.
전체 355
42 International Conference Jae-Mo Yang, Chang-Heon Lee, Hong-Goo Kang "A robust time difference of arrival estimator in reverberant environments" in EUSIPCO, 2009
41 International Conference Ho Seon Shin, Hong-Goo Kang, Min-Seok Choi, Taesu Kim "SPEECH REINFORCEMENT BASED ON BINAURAL LOUDNESS MODEL" in The Fourth Beijing-Hong Kong International Doctoral Forum, 2009
40 International Conference Chi-Sang Jung, Moo-Young Kim, Hong-Goo Kang "Normalized minimum-redundancy and maximum-relevancy based feature selection for speaker verification systems" in ICASSP, pp.4549-4552, 2009
39 International Conference Dongil-Hyun,Tacksung Choi, Daehee Youn, Seokpil Lee, Youngcheol Park "The Use of Delay Control for Stereophonic Audio Rendering Based on VBAP" in 125th Convention of Audio Engineering Society, pp.7603, 2008
38 International Conference Gun-Woo Lee, Jae-sung Lee, Young-Cheol Park, Dae Hee Youn "Quality Improvement of Very Low Bit Rate HE-AAC Using Linear Prediction Module" in 125th Convention of Audio Engineering Society, pp.7624, 2008
37 International Conference Myung-Suk Song, SoonH Beak,Seok-Pil Lee, Hong-Goo Kang "Constrained-Optimized Sound Beamforming of Loudspeaker-Array System" in 125th Convention of Audio Engineering Society, pp.7641, 2008
36 International Conference Yang-Won Jung, Hyen-O Oh "Personalized Music Service Based on Parametric Object Oriented Spatial Audio Coding" in AES 34th International Conference, 2008
35 International Conference Myung-Suk Song, Hyun-Woo Kang, Hong-Goo Kang "Discrimination of Music Signals for Mobile Broadcasting Receivers" in AES 34th International Conference, 2008
34 International Conference Junho Lee, Young-Cheol Park, Dae Hee Youn "Robust Crosstalk Cancellation Based on Energy-Based Control" in AES 34th International Conference, 2008
33 International Conference Jae-Mo Yang, Min-seok Choi, Hong-Goo Kang "Two-channel DOA estimation usign frequency selective music algorithm with a phase compensation in reverberant room" in 2008 5th IEEE Sensor Array and Multichannel Signal Processing Workshop, 2008