Papers

Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System

International Conference
2016~2020
작성자
한혜원
작성일
2018-04-01 16:34
조회
1430
Authors : Min-Jae Hwang, Eunwoo Song, Kyungguen Byun, Hong-Goo Kang

Year : 2018

Publisher / Conference : ICASSP

This paper proposes a novel noise compensation algorithm for a glottal excitation model in a deep learning (DL)-based speech synthesis system. To generate high-quality speech synthesis outputs, the balance between harmonic and noise components of the glottal excitation signal should be well-represented by the DL network. However, it is hard to accurately model the noise component because the DL training process inevitably results in statistically smoothed outputs; thus, it is essential to introduce an additional noise compensation process. We propose a modeling-by-generation structure-based noise compensation method that the missing noise component in the generated glottal signal is directly extracted and parameterized during the entire training process. By modeling the noise component using the additional DL network, the proposed system successfully compensates the missing noise component. Objective and subjective test results confirm that the synthesized speech with the proposed noise compensation method is superior to that with conventional methods.
전체 355
275 International Conference Haemin Yang, Soyeon Choe, Keulbit Kim, Hong-Goo Kang "Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement" in ICSigSys, 2018
274 International Conference Min-Jae Hwang, Eunwoo Song, Kyungguen Byun, Hong-Goo Kang "Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System" in ICASSP, 2018
273 International Conference Jinyoung Lee, Chahyeon Eom, Youngsu Kwak, Hong-Goo Kang, Chungyoung Lee "DNN-based Wireless Positioning in An Outdoor Environment" in ICASSP, 2018
272 International Conference Seung-chul Shin, Sangyeop Lee, Taeho Lee, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Two electrode based healthcare device for continuously monitoring ECG and BIA signals" in BHI, 2018
271 International Journal JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "Generic uniform search grid generation algorithm for far-field source localization" in The Journal of the Acoustical Society of America, vol.143, 2018
270 International Journal Min-Jae Hwang, JeeSok Lee, MiSuk Lee, Hong-Goo Kang "SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals" in IEEE Transactions on Multimedia, vol.20, issue 1, pp.45-54, 2018
269 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems" in ASRU, 2017
268 Domestic Conference 양해민, 강홍구 "잡음 예측을 위한 심층 신경망기반 음성 존재 확률 계산법" in 대한전자공학회 추계학술대회, 2017
267 Domestic Conference 오상신, 정수환, 강홍구 "음성 인식 기반의 방송미디어 디바이스 제어 및 편집 시스템 구현" in 대한전자공학회 추계학술대회, 2017
266 International Journal Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems" in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.25, issue 11, pp.2152-2161, 2017