Papers

A pitch-synchronous speech analysis and synthesis method for DNN-SPSS system

International Conference
2016~2020
작성자
한혜원
작성일
2016-10-01 00:52
조회
1695
Authors : Jin-Seob Kim, Young-Sun Joo, Inseon Jang, ChungHyun Ahn, Jeongil Seo, Hong-Goo Kang

Year : 2016

Publisher / Conference : 21th International Conference on Digital Signal Processing (DSP)

This paper proposes a pitch-synchronous deep neural network (DNN)-based statistical parametric speech synthesis (SPSS) system. The pitch-synchronous frames defined by the locations of glottal closure instants (GCIs) are used to extract speech parameters, which significantly reduce coupling effects between vocal tract and excitation signals. As a result, the distribution of spectral parameters within the same context of phonetic classes becomes more uniform, which improves a model trainability especially for a small-scaled DNN framework. Although the effectiveness of pitch-synchronous approach has been proven in other applications, it is not trivial to integrate the method into the typical DNN-based SPSS systems that have regularized structures, i.e. fixed frame rate and fixed dimension of features. In this paper, we design a new DNN-based SPSS system that pitch-synchronously trains and generates speech parameters. Objective and subjective test results verify the superiority of the proposed system compared to the conventional approach.
전체 364
88 International Conference Jinkyu Lee, Keulbit Kim, Turaj Shabestary, Hong-Goo Kang "Deep bi-directional long short-term memory based speech enhancement for wind noise reduction" in HSCMA, 2017
87 International Conference JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "A study on search grid points for data-driven 3-D beamsteering" in HSCMA, 2017
86 International Conference Young-Sun Joo, Won-Suk Jun, Hong-Goo Kang "Efficient deep neural networks for speech synthesis using bottleneck features" in APSIPA, 2016
85 International Conference Ji-ho Seo, Young-cheol Park, Dae Hee Youn "Design of feedback active noise control system based on a constrained optimization for headphone/earphone applications" in IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia), 2016
84 International Conference Haemin Yang, Kyungguen Byun, Youngsu Kwak, Hong-Goo Kang "Parametric-based non-intrusive speech quality assessment by deep neural network" in 21th International Conference on Digital Signal Processing (DSP), 2016
83 International Conference Jin-Seob Kim, Young-Sun Joo, Inseon Jang, ChungHyun Ahn, Jeongil Seo, Hong-Goo Kang "A pitch-synchronous speech analysis and synthesis method for DNN-SPSS system" in 21th International Conference on Digital Signal Processing (DSP), 2016
82 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis" in INTERSPEECH, 2016
81 International Conference Eunwoo Song, Hong-Goo Kang "Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis" in EUSIPCO, 2016
80 International Conference Hyeongi Moon, Gyutae Park, Yeong-cheol Park, Dae Hee Youn "A Phase-Matched Exponential Harmonic Weighting for Improved Sensation of Virtual Bass" in 140th Convention of Audio Engineering Society, pp.9544, 2016
79 International Conference Il-eun Kwak, Hong-Goo Kang "Robust formant features for speaker verification in the lombard effect" in APSIPA, pp.114-118, 2015