Papers

Parametric-based non-intrusive speech quality assessment by deep neural network

International Conference
2016~2020
작성자
한혜원
작성일
2016-10-01 00:53
조회
263
Authors : Haemin Yang, Kyungguen Byun, Youngsu Kwak, Hong-Goo Kang

Year : 2016

Publisher / Conference : 21th International Conference on Digital Signal Processing (DSP)

This paper proposes a deep neural network (DNN) based non-intrusive speech quality estimation method in real-time voice communication systems. Since the proposed method only utilizes real-time control protocol (RTCP) information in the receiver side and does not need a reference signal, it is possible to continuously monitor the quality of service (QoS). Unlike the conventional non-intrusive E-model system that predicts QoS by utilizing delay, jitter, and type of codec with a rule-based method, the proposed method actively estimates the non-linear relationship between multi-dimensional parameters of RTCP and subjectively motivated reference scores using a DNN structure. In order to select efficient features, the relationship between each parameter of RTCP and perceptual objective listening quality assessment (POLQA) is thoroughly investigated, then we train the DNN model by changing the number of layers and nodes. The proposed algorithm achieved 0.8693 correlation with 21,206 reference POLQA scores that are sampled from real environment.
전체 322
322 International Conference Doyeon Kim, Hyewon Han, Hyeon-Kyeong Shin, Soo-Whan Chung, Hong-Goo Kang "Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement" in ICASSP, 2022
321 International Conference Chanwoo Lee, Hyungseob Lim, Jihyun Lee, Inseon Jang, Hong-Goo Kang "Progressive Multi-Stage Neural Audio Coding with Guided References" in ICASSP, 2022
320 International Conference Jihyun Lee, Hyungseob Lim, Chanwoo Lee, Inseon Jang, Hong-Goo Kang "Adversarial Audio Synthesis Using a Harmonic-Percussive Discriminator" in ICASSP, 2022
319 International Conference Jinyoung Lee and Hong-Goo Kang "Stacked U-Net with High-level Feature Transfer for Parameter Efficient Speech Enhancement" in APSIPA ASC, 2021
318 International Conference Huu-Kim Nguyen, Kihyuk Jeong, Se-Yun Um, Min-Jae Hwang, Eunwoo Song, Hong-Goo Kang "LiteTTS: A Decoder-free Light-weight Text-to-wave Synthesis Based on Generative Adversarial Networks" in INTERSPEECH, 2021
317 International Conference Zainab Alhakeem, Yoohwan Kwon, Hong-Goo Kang "Disentangled Representations for Arabic Dialect Identification based on Supervised Clustering with Triplet Loss" in EUSIPCO, 2021
316 International Conference Miseul Kim, Minh-Tri Ho, Hong-Goo Kang "Self-supervised Complex Network for Machine Sound Anomaly Detection" in EUSIPCO, 2021
315 International Conference Kihyuk Jeong, Huu-Kim Nguyen, Hong-Goo Kang "A Fast and Lightweight Text-To-Speech Model with Spectrum and Waveform Alignment Algorithms" in EUSIPCO, 2021
314 International Conference Jiyoung Lee*, Soo-Whan Chung*, Sunok Kim, Hong-Goo Kang**, Kwanghoon Sohn** "Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation" in CVPR, 2021
313 International Conference Zainab Alhakeem, Hong-Goo Kang "Confidence Learning from Noisy Labels for Arabic Dialect Identification" in ITC-CSCC, 2021