Papers

Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement

International Conference
2016~2020
작성자
한혜원
작성일
2018-05-01 16:35
조회
2323
Authors : Haemin Yang, Soyeon Choe, Keulbit Kim, Hong-Goo Kang

Year : 2018

Publisher / Conference : ICSigSys

In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a function of noise power spectral density (PSD) in time frequency domain, and the state-of-the-art algorithm also introduces additional processes to estimate speech presence probability (SPP) to further enhance the estimation. Due to many tuning parameters, however, it is not easy to implement an algorithm that reliably estimates SPP in noise varying environment. We proposed a combination of deep learning network and an effective training method to enhance the performance of the SPP estimation module. The proposed approach is regarded as a hybrid approach, with the noise reduction factor still estimated by the conventional statistic-based single channel enhancement algorithms. The advantages and disadvantages of the proposed approach compared to deep learning approach of single channel speech enhancement are also investigated.
전체 364
27 International Conference Minh-Tri Ho, Jinyoung Lee, Bong-Ki Lee, Dong Hoon Yi, Hong-Goo Kang "A Cross-channel Attention-based Wave-U-Net for Multi-channel Speech Enhancement" in INTERSPEECH, 2020
26 International Conference Seyun Um, Sangshin Oh, Kyungguen Byun, Inseon Jang, ChungHyun Ahn, Hong-Goo Kang "Emotional Speech Synthesis with Rich and Granularized Control" in ICASSP, 2020
25 International Conference Min-Jae Hwang, Eunwoo Song, Ryuichi Yamamoto, Frank Soong, Hong-Goo Kang "Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network" in ICASSP, 2020
24 International Conference Hyeonjoo Kang, Young-Sun Joo, Inseon Jang, Chunghyun Ahn, Hong-Goo Kang "A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis" in APSIPA, 2019
23 International Conference Min-Jae Hwang, Hong-Goo Kang "Parameter enhancement for MELP speech codec in noisy communication environment" in INTERSPEECH, 2019
22 International Conference Keulbit Kim, Jinkyu Lee, Jan Skoglund, Hong-Goo Kang "Model Order Selection for Wind Noise Reduction in Non-negative Matrix Factorization" in ITC-CSCC, 2019
21 International Conference Ohsung Kwon, Inseon Jang, ChungHyun Ahn, Hong-Goo Kang "Emotional Speech Synthesis Based on Style Embedded Tacotron2 Framework" in ITC-CSCC, 2019
20 International Conference Kyungguen Byun, Eunwoo Song, Jinseob Kim, Jae-Min Kim, Hong-Goo Kang "Excitation-by-SampleRNN Model for Text-to-Speech" in ITC-CSCC, 2019
19 International Conference Yang Yuan, Soo-Whan Chung, Hong-Goo Kang "Gradient-based active learning query strategy for end-to-end speech recognition" in ICASSP, 2019
18 International Conference Soo-Whan Chung, Joon Son Chung, Hong-Goo Kang "Perfect match: Improved cross-modal embeddings for audio-visual synchronisation" in ICASSP, 2019