Papers

Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement

International Conference
2016~2020
작성자
한혜원
작성일
2018-05-01 16:35
조회
376
Authors : Haemin Yang, Soyeon Choe, Keulbit Kim, Hong-Goo Kang

Year : 2018

Publisher / Conference : ICSigSys

In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a function of noise power spectral density (PSD) in time frequency domain, and the state-of-the-art algorithm also introduces additional processes to estimate speech presence probability (SPP) to further enhance the estimation. Due to many tuning parameters, however, it is not easy to implement an algorithm that reliably estimates SPP in noise varying environment. We proposed a combination of deep learning network and an effective training method to enhance the performance of the SPP estimation module. The proposed approach is regarded as a hybrid approach, with the noise reduction factor still estimated by the conventional statistic-based single channel enhancement algorithms. The advantages and disadvantages of the proposed approach compared to deep learning approach of single channel speech enhancement are also investigated.
전체 327
277 Domestic Conference 최소연, 정수환, 강홍구 "임베딩 매트릭스를 기반으로 한 비정상적 잡음 제거 알고리즘의 분석과 딥러닝 음질개선 방법들과의 성능비교" in 한국음향학회 추계발표대회, 2018
276 International Journal Jinkyu Lee, Jan Skoglund, Turaj Shabestary, Hong-Goo Kang "Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement" in IEEE Signal Processing Letters, vol.25, issue 8, pp.1276-1280, 2018
275 International Conference Haemin Yang, Soyeon Choe, Keulbit Kim, Hong-Goo Kang "Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement" in ICSigSys, 2018
274 International Conference Min-Jae Hwang, Eunwoo Song, Kyungguen Byun, Hong-Goo Kang "Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System" in ICASSP, 2018
273 International Conference Jinyoung Lee, Chahyeon Eom, Youngsu Kwak, Hong-Goo Kang, Chungyoung Lee "DNN-based Wireless Positioning in An Outdoor Environment" in ICASSP, 2018
272 International Conference Seung-chul Shin, Sangyeop Lee, Taeho Lee, Kyoungwoo Lee, Yong Seung Lee, Hong-Goo Kang "Two electrode based healthcare device for continuously monitoring ECG and BIA signals" in BHI, 2018
271 International Journal JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "Generic uniform search grid generation algorithm for far-field source localization" in The Journal of the Acoustical Society of America, vol.143, 2018
270 International Journal Min-Jae Hwang, JeeSok Lee, MiSuk Lee, Hong-Goo Kang "SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals" in IEEE Transactions on Multimedia, vol.20, issue 1, pp.45-54, 2018
269 International Conference Eunwoo Song, Frank K. Soong, Hong-Goo Kang "Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems" in ASRU, 2017
268 Domestic Conference 양해민, 강홍구 "잡음 예측을 위한 심층 신경망기반 음성 존재 확률 계산법" in 대한전자공학회 추계학술대회, 2017