Papers

Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system

International Conference
2011~2015
작성자
한혜원
작성일
2015-04-01 00:45
조회
1739
Authors : Eunwoo Song, Young-Sun Joo, Hong-Goo Kang

Year : 2015

Publisher / Conference : ICASSP

This paper proposes an improved time-frequency trajectory excitation (TFTE) modeling method for a statistical parametric speech synthesis system. The proposed approach overcomes the dimensional variation problem of the training process caused by the inherent nature of the pitch-dependent analysis paradigm. By reducing the redundancies of the parameters using predicted average block coefficients (PABC), the proposed algorithm efficiently models excitation, even if its dimension is varied. Objective and subjective test results verify that the proposed algorithm provides not only robustness to the training process but also naturalness to the synthesized speech.
전체 360
77 International Conference Kyungguen Byun, Eunwoo Song, Hong-goo Kang "A constrained two-layer compression technique for ECG waves" in Enegineering in Medicine and Biology Society (EMBC), 2015
76 International Conference Eunwoo Song, Hong-Goo Kang "Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo" in INTERSPEECH, 2015
75 International Conference Heejin Ahn, Eunwoo Song, Won-Suk Jun, Hong-goo Kang "A Compression Algorithms for Hidden Markov Model-Based Speech Synthesis Systems" in ITC-CSCC, pp.942-945, 2015
74 International Conference JeeSok Lee, Sejin Oh, Hong-Goo Kang "Coherent channel based subband multichannel dereverberation" in ICASSP, pp.2704-2708, 2015
73 International Conference Eunwoo Song, Young-Sun Joo, Hong-Goo Kang "Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system" in ICASSP, 2015
72 International Conference Eunwoo Song, Hong-Goo Kang, Joonil Lee "Fixed-point implementation of MPEG-D unified speech and audio coding decoder" in 19th International Conference on Digital Signal Processing (DSP), pp.110-113, 2014
71 International Conference Soonho Baek, Hong-Goo Kang "Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment" in ICASSP, 2014
70 International Conference Ho Seon Shin, Hong-Goo Kang "Bone-Conduction Speech Enhancement using a Speaker-Independent Filter" in ICEIC, 2014
69 International Conference Soonho Baek, Hong-Goo Kang "Vector Taylor Series based HMM Adaptation for Generalized Cepstrum in Noisy Environment" in ASRU, 2013
68 International Conference Jung-Won Lee, Hong-Goo Kang, Samuel Kim, Yoonjae Lee "Detecting pathological speech using local and global characteristics of harmonic-to-noise ratio" in APSIPA, 2013