Papers

Deep bi-directional long short-term memory based speech enhancement for wind noise reduction

International Conference
2016~2020
작성자
한혜원
작성일
2017-03-01 16:30
조회
1470
Authors : Jinkyu Lee, Keulbit Kim, Turaj Shabestary, Hong-Goo Kang

Year : 2017

Publisher / Conference : HSCMA

In this paper, we propose a new recurrent neural network (RNN)-based single-channel speech enhancement framework for off-line wind noise reduction. To adequately represent highly non-stationary characteristics of wind noise, we first adopt a deep bi-directional long short-term memory (DBLSTM) structure. However, its enhanced output becomes muffled due to the spectral over-smoothing effect. To overcome this problem, we propose a new structure of DBLSTM-based speech enhancement system that internally incorporates the speech and noise power estimation processes in the spectral filtering framework. Furthermore, we propose a structure with an additional internal constraint of minimizing log a priori SNR, which provides efficient learning mechanism. Experimental results show that the proposed method improves source-to-distortion ratio (SDR) by 6.9 dB and perceptual evaluation of speech quality (PESQ) by 0.24 in comparison to the conventional DBLSTM-based system.
전체 355
7 International Journal JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "Generic uniform search grid generation algorithm for far-field source localization" in The Journal of the Acoustical Society of America, vol.143, 2018
6 International Journal Min-Jae Hwang, JeeSok Lee, MiSuk Lee, Hong-Goo Kang "SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals" in IEEE Transactions on Multimedia, vol.20, issue 1, pp.45-54, 2018
5 International Conference JeeSok Lee, Soo-Whan Chung, Min-Seok Choi, Hong-Goo Kang "A study on search grid points for data-driven 3-D beamsteering" in HSCMA, 2017
4 Domestic Conference Min-jae Hwang, JeeSok Lee, Misuk Lee, and Hong-Goo Kang "사전 분석법을 통한 스프레드 스펙트럼 기반 오디오 워터마킹 알고리즘의 성능 향상" in 한국음향학회 제 33회 음성통신 및 신호처리 학술대회, 2016
3 International Conference Hyeonjoo Kang, JeeSok Lee, Soonho Baek, Hong-Goo Kang "Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems" in INTERSPEECH, 2015
2 International Conference JeeSok Lee, Sejin Oh, Hong-Goo Kang "Coherent channel based subband multichannel dereverberation" in ICASSP, pp.2704-2708, 2015
1 International Conference JeeSok Lee, Frank Soong, Hong-Goo Kang "Source-Filter based Full-band Adaptive Harmonic Model and Its Application to Speech Prosody Modification" in INTERSPEECH, 2013