Papers

Deep bi-directional long short-term memory based speech enhancement for wind noise reduction

International Conference

2016~2020

작성자

한혜원

작성일

2017-03-01 16:30

조회

1440

Authors : Jinkyu Lee, Keulbit Kim, Turaj Shabestary, Hong-Goo Kang

Year : 2017

Publisher / Conference : HSCMA

In this paper, we propose a new recurrent neural network (RNN)-based single-channel speech enhancement framework for off-line wind noise reduction. To adequately represent highly non-stationary characteristics of wind noise, we first adopt a deep bi-directional long short-term memory (DBLSTM) structure. However, its enhanced output becomes muffled due to the spectral over-smoothing effect. To overcome this problem, we propose a new structure of DBLSTM-based speech enhancement system that internally incorporates the speech and noise power estimation processes in the spectral filtering framework. Furthermore, we propose a structure with an additional internal constraint of minimizing log a priori SNR, which provides efficient learning mechanism. Experimental results show that the proposed method improves source-to-distortion ratio (SDR) by 6.9 dB and perceptual evaluation of speech quality (PESQ) by 0.24 in comparison to the conventional DBLSTM-based system.

« A study on search grid points for data-driven 3-D beamsteering

저사양 TV 사운드 설계환경을 위한 IIR 필터 기반 주파수 등화기 »

목록보기

전체 355

Domestic Conference

김정규, 박영철, 강홍구 "저사양 TV 사운드 설계환경을 위한 IIR 필터 기반 주파수 등화기" in 대한전자공학회 학술대회, 2017

Deep bi-directional long short-term memory based speech enhancement for wind noise reduction

Previous

Sister Lab.

Yonsei University

Academic Website