Papers

Deep bi-directional long short-term memory based speech enhancement for wind noise reduction

International Conference
2016~2020
작성자
한혜원
작성일
2017-03-01 16:30
조회
1440
Authors : Jinkyu Lee, Keulbit Kim, Turaj Shabestary, Hong-Goo Kang

Year : 2017

Publisher / Conference : HSCMA

In this paper, we propose a new recurrent neural network (RNN)-based single-channel speech enhancement framework for off-line wind noise reduction. To adequately represent highly non-stationary characteristics of wind noise, we first adopt a deep bi-directional long short-term memory (DBLSTM) structure. However, its enhanced output becomes muffled due to the spectral over-smoothing effect. To overcome this problem, we propose a new structure of DBLSTM-based speech enhancement system that internally incorporates the speech and noise power estimation processes in the spectral filtering framework. Furthermore, we propose a structure with an additional internal constraint of minimizing log a priori SNR, which provides efficient learning mechanism. Experimental results show that the proposed method improves source-to-distortion ratio (SDR) by 6.9 dB and perceptual evaluation of speech quality (PESQ) by 0.24 in comparison to the conventional DBLSTM-based system.
전체 355
1 Domestic Conference 김정규, 박영철, 강홍구 "저사양 TV 사운드 설계환경을 위한 IIR 필터 기반 주파수 등화기" in 대한전자공학회 학술대회, 2017