Papers

Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo

International Conference
2011~2015
작성자
한혜원
작성일
2015-09-01 00:47
조회
1421
Authors : Eunwoo Song, Hong-Goo Kang

Year : 2015

Publisher / Conference : INTERSPEECH

This paper proposes a deep neural network (DNN)-based statistical parametric speech synthesis system using an improved time-frequency trajectory excitation (ITFTE) model. The ITFTE model, which efficiently reduces the parametric redundancy of a TFTE model, improved the perceptual quality of the vocoding process and the estimation accuracy of the training process. However, there remain problems related to training ITFTE parameters in a hidden Markov model (HMM) framework, such as inefficiency of representing cross-dimensional correlations between ITFTE parameters, over-smoothed outputs caused by statistical averaging, and an over-fitted model due to a decision tree-based state clustering paradigm. To alleviate these limitations, a centralized DNN replaces the decision trees of the HMM training process. Analysis of trainability confirms that the DNN training process improves the model accuracy, which results in improved perceptual quality of synthesized speech. Objective and subjective test results also verify that the proposed system performs better than the conventional HMM-based system.
전체 355
53 Domestic Journal Ji-ho Seo, Dae Hee Youn, Young-Cheol Park "A Method of Designing Low-power Feedback Active Noise Control Filter for Headphones/Earphones" in 한국통신학회논문지, vol.10, 제 1호, pp.57-65, 2017
52 International Conference Ji-ho Seo, Young-cheol Park, Dae Hee Youn "Design of feedback active noise control system based on a constrained optimization for headphone/earphone applications" in IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia), 2016
51 International Conference Hyeongi Moon, Gyutae Park, Yeong-cheol Park, Dae Hee Youn "A Phase-Matched Exponential Harmonic Weighting for Improved Sensation of Virtual Bass" in 140th Convention of Audio Engineering Society, pp.9544, 2016
50 International Journal Taegyu Lee, Hyun Oh Oh, Jeongil Seo, Young-Cheol Park, Dae Hee Youn "Scalable Multiband Binaural Renderer for MPEG-H 3D Audio" in IEEE Journal of Selected Topics in Signal Processing, vol.9, issue 5, pp.907-920, 2015
49 International Journal Taegyu Lee, Yonghyun Baek, Young-Cheol Park, Dae Hee Youn "Stereo upmix-based binaural auralization for mobile devices" in IEEE Transactions on Consumer Electronics, vol.60, issue 3, pp.411-419, 2014
48 International Journal Taegyu Lee, Yonghyun Baek, Young-Cheol Park, Dae Hee Youn "Stereo upmix-based binaural auralization for mobile devices" in IEEE Transactions on Consumer Electronics, vol.60, issue.3, pp.411-419, 2014
47 International Journal Seong-woo Kim, Young-Cheol Park, Dae Hee Youn "A variable step-size gradient adaptive lattice algorithm for multiple sinusoidal interference cancelation" in EURASIP Journal on Advances in Signal Processing, vol.106, 2013
46 International Conference Taegyu Lee, Seokjin Lee, Young-cheol Park, Dae Hee Youn "Virtual bass system based on a multiband harmonic generation" in ICCE, 2013
45 International Conference Se-Woon Jeon, Dae Hee Youn, Young-Cheol Park "Blind depth estimation based on primary-to-ambient energy ratio for 3-D acoustic depth rendering" in APSIPA ASC, 2012
44 International Journal Dong-il Hyun, Young-Cheol Park, Dae Hee Youn "Estimation and quantization of ICC-dependent phase parameters for parametric stereo audio coding" in EURASIP Journal on Audio, Speech, and Music Processing, vol.27, 2012