Papers

Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo

International Conference
2011~2015
작성자
한혜원
작성일
2015-09-01 00:47
조회
1262
Authors : Eunwoo Song, Hong-Goo Kang

Year : 2015

Publisher / Conference : INTERSPEECH

This paper proposes a deep neural network (DNN)-based statistical parametric speech synthesis system using an improved time-frequency trajectory excitation (ITFTE) model. The ITFTE model, which efficiently reduces the parametric redundancy of a TFTE model, improved the perceptual quality of the vocoding process and the estimation accuracy of the training process. However, there remain problems related to training ITFTE parameters in a hidden Markov model (HMM) framework, such as inefficiency of representing cross-dimensional correlations between ITFTE parameters, over-smoothed outputs caused by statistical averaging, and an over-fitted model due to a decision tree-based state clustering paradigm. To alleviate these limitations, a centralized DNN replaces the decision trees of the HMM training process. Analysis of trainability confirms that the DNN training process improves the model accuracy, which results in improved perceptual quality of synthesized speech. Objective and subjective test results also verify that the proposed system performs better than the conventional HMM-based system.
전체 355
92 International Conference Jinyoung Lee, Chahyeon Eom, Youngsu Kwak, Hong-Goo Kang, Chungyoung Lee "DNN-based Wireless Positioning in An Outdoor Environment" in ICASSP, 2018
91 International Conference Seung-chul Shin, Sangyeop Lee, Taeho Lee, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Two electrode based healthcare device for continuously monitoring ECG and BIA signals" in BHI, 2018
90 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems" in ASRU, 2017
89 International Conference Seung-chul Shin, Junhyung Moon, Saewon Kye, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Continuous bladder volume monitoring system for wearable applications" in EMBC, 2017
88 International Conference Jinkyu Lee, Keulbit Kim, Turaj Shabestary, Hong-Goo Kang "Deep bi-directional long short-term memory based speech enhancement for wind noise reduction" in HSCMA, 2017
87 International Conference JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "A study on search grid points for data-driven 3-D beamsteering" in HSCMA, 2017
86 International Conference Young-Sun Joo, Won-Suk Jun, Hong-Goo Kang "Efficient deep neural networks for speech synthesis using bottleneck features" in APSIPA, 2016
85 International Conference Ji-ho Seo, Young-cheol Park, Dae Hee Youn "Design of feedback active noise control system based on a constrained optimization for headphone/earphone applications" in IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia), 2016
84 International Conference Haemin Yang, Kyungguen Byun, Youngsu Kwak, Hong-Goo Kang "Parametric-based non-intrusive speech quality assessment by deep neural network" in 21th International Conference on Digital Signal Processing (DSP), 2016
83 International Conference Jin-Seob Kim, Young-Sun Joo, Inseon Jang, ChungHyun Ahn, Jeongil Seo, Hong-Goo Kang "A pitch-synchronous speech analysis and synthesis method for DNN-SPSS system" in 21th International Conference on Digital Signal Processing (DSP), 2016