Papers

A pitch-synchronous speech analysis and synthesis method for DNN-SPSS system

International Conference
2016~2020
작성자
한혜원
작성일
2016-10-01 00:52
조회
3111
Authors : Jin-Seob Kim, Young-Sun Joo, Inseon Jang, ChungHyun Ahn, Jeongil Seo, Hong-Goo Kang

Year : 2016

Publisher / Conference : 21th International Conference on Digital Signal Processing (DSP)

This paper proposes a pitch-synchronous deep neural network (DNN)-based statistical parametric speech synthesis (SPSS) system. The pitch-synchronous frames defined by the locations of glottal closure instants (GCIs) are used to extract speech parameters, which significantly reduce coupling effects between vocal tract and excitation signals. As a result, the distribution of spectral parameters within the same context of phonetic classes becomes more uniform, which improves a model trainability especially for a small-scaled DNN framework. Although the effectiveness of pitch-synchronous approach has been proven in other applications, it is not trivial to integrate the method into the typical DNN-based SPSS systems that have regularized structures, i.e. fixed frame rate and fixed dimension of features. In this paper, we design a new DNN-based SPSS system that pitch-synchronously trains and generates speech parameters. Objective and subjective test results verify the superiority of the proposed system compared to the conventional approach.
전체 371
83 International Conference Jin-Seob Kim, Young-Sun Joo, Inseon Jang, ChungHyun Ahn, Jeongil Seo, Hong-Goo Kang "A pitch-synchronous speech analysis and synthesis method for DNN-SPSS system" in 21th International Conference on Digital Signal Processing (DSP), 2016
82 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis" in INTERSPEECH, 2016
81 International Conference Eunwoo Song, Hong-Goo Kang "Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis" in EUSIPCO, 2016
80 International Conference Hyeongi Moon, Gyutae Park, Yeong-cheol Park, Dae Hee Youn "A Phase-Matched Exponential Harmonic Weighting for Improved Sensation of Virtual Bass" in 140th Convention of Audio Engineering Society, pp.9544, 2016
79 International Conference Il-eun Kwak, Hong-Goo Kang "Robust formant features for speaker verification in the lombard effect" in APSIPA, pp.114-118, 2015
78 International Conference Hyeonjoo Kang, JeeSok Lee, Soonho Baek, Hong-Goo Kang "Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems" in INTERSPEECH, 2015
77 International Conference Kyungguen Byun, Eunwoo Song, Hong-goo Kang "A constrained two-layer compression technique for ECG waves" in Enegineering in Medicine and Biology Society (EMBC), 2015
76 International Conference Eunwoo Song, Hong-Goo Kang "Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo" in INTERSPEECH, 2015
75 International Conference Heejin Ahn, Eunwoo Song, Won-Suk Jun, Hong-goo Kang "A Compression Algorithms for Hidden Markov Model-Based Speech Synthesis Systems" in ITC-CSCC, pp.942-945, 2015
74 International Conference JeeSok Lee, Sejin Oh, Hong-Goo Kang "Coherent channel based subband multichannel dereverberation" in ICASSP, pp.2704-2708, 2015