Papers

An Efficient Segment-Based Speech Compression Technique for Hand-Held TTS Systems

International Conference
2006~2010
작성자
한혜원
작성일
2006-09-17 22:40
조회
1054
Authors : Chang-Heon Lee, Sung-Kyo Jung, Thomas Eriksson, Won-Suk Jun, Hong-Goo Kang

Year : 2006

Publisher / Conference : INTERSPEECH

This paper proposes a novel segment-based speech coding algorithm to efficiently compress the database for concatenative text-to-speech (TTS) systems. To achieve a high compression ratio and meet the fundamental requirements of concatenative TTS synthesizers, i.e. partial segment decoding and random access capability, we adopt a modified analysis-by-synthesis scheme. The spectral coefficients are quantized by a length-based interpolation method and excitation signals are modeled with both non-predictive and predictive approaches. Considering that pitch pulse waveforms of a specific speaker show low intra-variation, the conventional adaptive codebook for pitch prediction is replaced by a speaker dependent pitch-pulse codebook. By applying the proposed algorithm to a hand-held Korean TTS system, we verify that the proposed coder provides a compression ratio of about 1/13, a low complexity of around 1.2 WMOPS, and random access capability.
전체 355
32 International Conference Myung-Suk Song, Seok-Pil Lee, Hong-Goo Kang "Simulation and Measurement of Sound Beam Forming by Speaker-Array in Room Reverberation Environment" in ICEIC, pp.1050-1054, 2008
31 International Conference Min-seok Choi, Hong-Goo Kang "A Two-Channel Minimum Mean-Square Error Log-Spectral Amplitude Estimator for Speech Enhancement" in HSCMA, 2008
30 International Conference Yoomi Hur, Young-Choel Park, Seok-Pil LEE, Dae Hee Youn "Efficient Individualization of HRTF Using Critical-Band Based Spectral Cues Control" in 124th Convention of Audio Engineering Society, pp.7447, 2008
29 International Conference Jae-woong Jeong, Young-Choel Park, Seok-Pil Lee, Dae Hee Youn "Multi-Channel Dereverberation System Using Modified Correlation-Based Blind Deconvolution and Multi-Microphone Spectral Subtrac" in 124th Convention of Audio Engineering Society, pp.7402, 2008
28 International Conference Sang-Wook Shin, Chang-Heon Lee, Hyen-O Oh, Hong-Goo Kang "Designing a unified speech/audio codec by adopting a single channel harmonic source separation module" in ICASSP, 2008
27 International Conference Jeng-Geun Kim, Dong-il Hyun, Dae Hee Youn, Young-Cheol Park "Quality Improvement Using a Sinusoidal Model in HE-AAC" in 123th Convention of Audio Engineering Society, pp.7292, 2007
26 International Conference Min-Ki Lee, Kyung-Tae Kim, Hong-Goo Kang, Dae Hee Youn "Speech Quality Estimation Using Packet Loss Effects in CELP-Type Speech Coders" in INTERSPEECH, pp.1697-1700, 2007
25 International Conference Chang-Heon Lee, Hong-Goo Kang "Improvement of Artificial Onset Reconstruction in VMR-WB Standard under Packet-loss Environments" in ITC-CSCC, 2007
24 International Conference Sun-kuk Moon, Tack-sung Choi, Young-Cheol Park, Dae Hee Youn "An Efficient Feature Selection Algorithm Based on Kullback-Leibler Divergence for Music Information Retrieval" in ITC-CSCC, pp.524-525, 2007
23 International Conference Min-Ki Lee, Sung-Wan Youn, Kyung-Tae Kim, Hong-Goo Kang "Speech Quality Degradation in Packet Loss Environment at Specific Speech Class" in ITC-CSCC, pp.781-782, 2007