Papers
Model Order Selection for Wind Noise Reduction in Non-negative Matrix Factorization
International Conference
2016~2020
작성자
한혜원
작성일
2019-06-01 16:47
조회
2884
We first investigate whether the quality of enhanced output varies depending on the ratio of the size of speech models to that of noise models. We especially show that the optimal ratio is related to the signal-to-noise ratio (SNR) of the input signal. Based on the results of analysis, we propose an efficient algorithm for adaptively changing the size of speech and noise NMF models in each analysis frame. Since the proposed algorithm takes into account the trade- off relationship between speech distortion and noise reduction, its output quality becomes very natural. The experimental results also confirm the superiority of the proposed algorithm to conventional template matching based algorithms.
전체 370
102 | International Conference | Min-Jae Hwang, Hong-Goo Kang "Parameter enhancement for MELP speech codec in noisy communication environment" in INTERSPEECH, 2019 | |
101 | International Conference | Keulbit Kim, Jinkyu Lee, Jan Skoglund, Hong-Goo Kang "Model Order Selection for Wind Noise Reduction in Non-negative Matrix Factorization" in ITC-CSCC, 2019 | |
100 | International Conference | Ohsung Kwon, Inseon Jang, ChungHyun Ahn, Hong-Goo Kang "Emotional Speech Synthesis Based on Style Embedded Tacotron2 Framework" in ITC-CSCC, 2019 | |
99 | International Conference | Kyungguen Byun, Eunwoo Song, Jinseob Kim, Jae-Min Kim, Hong-Goo Kang "Excitation-by-SampleRNN Model for Text-to-Speech" in ITC-CSCC, 2019 | |
98 | International Conference | Yang Yuan, Soo-Whan Chung, Hong-Goo Kang "Gradient-based active learning query strategy for end-to-end speech recognition" in ICASSP, 2019 | |
97 | International Conference | Soo-Whan Chung, Joon Son Chung, Hong-Goo Kang "Perfect match: Improved cross-modal embeddings for audio-visual synchronisation" in ICASSP, 2019 | |
96 | International Conference | Hyewon Han, Kyungguen Byun, Hong-Goo Kang "A Deep Learning-based Stress Detection Algorithm with Speech Signal" in Workshop on Audio-Visual Scene Understanding for Immersive Multimedia (AVSU’18), 2018 | |
95 | International Conference | Min-Jae Hwang, Eunwoo Song, Jin-Seob Kim, Hong-Goo Kang "A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems" in INTERSPEECH, 2018 | |
94 | International Conference | Haemin Yang, Soyeon Choe, Keulbit Kim, Hong-Goo Kang "Deep learning-based speech presence probability estimation for noise PSD estimation in single-channel speech enhancement" in ICSigSys, 2018 | |
93 | International Conference | Min-Jae Hwang, Eunwoo Song, Kyungguen Byun, Hong-Goo Kang "Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System" in ICASSP, 2018 | |