Source-Filter based Full-band Adaptive Harmonic Model and Its Application to Speech Prosody Modification

International Conference
2013-08-25 00:21
Authors : JeeSok Lee, Frank Soong, Hong-Goo Kang

Year : 2013

Publisher / Conference : INTERSPEECH

This paper presents a source-filter based adaptive harmonic model (aHM) that can modify prosody of given speech signals. Although the conventional aHM generates a homogeneous replication of the input speech, it is not suitable for prosody modification since temporal and spectral information are interweaved. The proposed method overcomes such limitation by further decomposing the harmonic parameter extracted from aHM into source and filter related components. By applying source-filter structure to aHM, the proposed algorithm can modify pitch of the synthesized speech with introducing only minor degradation. Both objective and subjective test results show that the proposed algorithm can naturally manipulate pitch contour, of which performance is much better than conventional algorithms such as pitch synchronous overlap add (PSOLA) and speech transformation and representation using adaptive interpolation of weighted spectrum (STRAIGHT). Index Terms: Prosody modification, pitch modification, speech analysis, speech synthesis, harmonic model.
전체 344
84 International Conference Haemin Yang, Kyungguen Byun, Youngsu Kwak, Hong-Goo Kang "Parametric-based non-intrusive speech quality assessment by deep neural network" in 21th International Conference on Digital Signal Processing (DSP), 2016
83 International Conference Jin-Seob Kim, Young-Sun Joo, Inseon Jang, ChungHyun Ahn, Jeongil Seo, Hong-Goo Kang "A pitch-synchronous speech analysis and synthesis method for DNN-SPSS system" in 21th International Conference on Digital Signal Processing (DSP), 2016
82 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis" in INTERSPEECH, 2016
81 International Conference Eunwoo Song, Hong-Goo Kang "Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis" in EUSIPCO, 2016
80 International Conference Hyeongi Moon, Gyutae Park, Yeong-cheol Park, Dae Hee Youn "A Phase-Matched Exponential Harmonic Weighting for Improved Sensation of Virtual Bass" in 140th Convention of Audio Engineering Society, pp.9544, 2016
79 International Conference Il-eun Kwak, Hong-Goo Kang "Robust formant features for speaker verification in the lombard effect" in APSIPA, pp.114-118, 2015
78 International Conference Hyeonjoo Kang, JeeSok Lee, Soonho Baek, Hong-Goo Kang "Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems" in INTERSPEECH, 2015
77 International Conference Kyungguen Byun, Eunwoo Song, Hong-goo Kang "A constrained two-layer compression technique for ECG waves" in Enegineering in Medicine and Biology Society (EMBC), 2015
76 International Conference Eunwoo Song, Hong-Goo Kang "Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo" in INTERSPEECH, 2015
75 International Conference Heejin Ahn, Eunwoo Song, Won-Suk Jun, Hong-goo Kang "A Compression Algorithms for Hidden Markov Model-Based Speech Synthesis Systems" in ITC-CSCC, pp.942-945, 2015