Papers

Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo

International Conference
2011~2015
작성자
한혜원
작성일
2015-09-01 00:47
조회
3806
Authors : Eunwoo Song, Hong-Goo Kang

Year : 2015

Publisher / Conference : INTERSPEECH

This paper proposes a deep neural network (DNN)-based statistical parametric speech synthesis system using an improved time-frequency trajectory excitation (ITFTE) model. The ITFTE model, which efficiently reduces the parametric redundancy of a TFTE model, improved the perceptual quality of the vocoding process and the estimation accuracy of the training process. However, there remain problems related to training ITFTE parameters in a hidden Markov model (HMM) framework, such as inefficiency of representing cross-dimensional correlations between ITFTE parameters, over-smoothed outputs caused by statistical averaging, and an over-fitted model due to a decision tree-based state clustering paradigm. To alleviate these limitations, a centralized DNN replaces the decision trees of the HMM training process. Analysis of trainability confirms that the DNN training process improves the model accuracy, which results in improved perceptual quality of synthesized speech. Objective and subjective test results also verify that the proposed system performs better than the conventional HMM-based system.
전체 372
94 International Conference Haemin Yang, Soyeon Choe, Keulbit Kim, Hong-Goo Kang "Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement" in ICSigSys, 2018
93 International Conference Min-Jae Hwang, Eunwoo Song, Kyungguen Byun, Hong-Goo Kang "Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System" in ICASSP, 2018
92 International Conference Jinyoung Lee, Chahyeon Eom, Youngsu Kwak, Hong-Goo Kang, Chungyoung Lee "DNN-based Wireless Positioning in An Outdoor Environment" in ICASSP, 2018
91 International Conference Seung-chul Shin, Sangyeop Lee, Taeho Lee, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Two electrode based healthcare device for continuously monitoring ECG and BIA signals" in BHI, 2018
90 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems" in ASRU, 2017
89 International Conference Seung-chul Shin, Junhyung Moon, Saewon Kye, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Continuous bladder volume monitoring system for wearable applications" in EMBC, 2017
88 International Conference Jinkyu Lee, Keulbit Kim, Turaj Shabestary, Hong-Goo Kang "Deep bi-directional long short-term memory based speech enhancement for wind noise reduction" in HSCMA, 2017
87 International Conference JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "A study on search grid points for data-driven 3-D beamsteering" in HSCMA, 2017
86 International Conference Young-Sun Joo, Won-Suk Jun, Hong-Goo Kang "Efficient deep neural networks for speech synthesis using bottleneck features" in APSIPA, 2016
85 International Conference Ji-ho Seo, Young-cheol Park, Dae Hee Youn "Design of feedback active noise control system based on a constrained optimization for headphone/earphone applications" in IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia), 2016