Papers

Fixed-point implementation of MPEG-D unified speech and audio coding decoder

International Conference

2011~2015

작성자

한혜원

작성일

2014-08-01 00:43

조회

3725

Authors : Eunwoo Song, Hong-Goo Kang, Joonil Lee

Year : 2014

Publisher / Conference : 19th International Conference on Digital Signal Processing (DSP)

Page : 110-113

This paper describes a fixed-point implementation method of the unified speech and audio coding (USAC) decoder that has been recently standardized by moving picture experts group (MPEG). Since the structure of USAC is too complicated to support both speech and audio signals, the quality and complexity issues must be carefully reviewed while performing fixed-point implementation. By analyzing the structure of the USAC decoder, this paper describes key ideas to successfully realize the fixedpoint system. Subjective and objective test results verify that the implemented fixed-point decoder shows equivalent quality to the floating-point decoder. The average and worst cases of complexity depending on the type of encoding modes are also given in detail.

« Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment

Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system »

목록보기

전체 372

114	International Conference	Suhyeon Oh, Hyungseob Lim, Kyungguen Byun, Min-Jae Hwang, Eunwoo Song, Hong-Goo Kang "ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis" in APSIPA (*awarded Best Paper), 2020
113	International Conference	Hyeon-Kyeong Shin, Hyewon Han, Kyungguen Byun, Hong-Goo Kang "Speaker-invariant Psychological Stress Detection Using Attention-based Network" in APSIPA, 2020
112	International Conference	Min-Jae Hwang, Frank Soong, Eunwoo Song, Xi Wang, Hyeonjoo Kang, Hong-Goo Kang "LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis" in APSIPA, 2020
111	International Conference	Hyungseob Lim, Suhyeon Oh, Kyungguen Byun, Hong-Goo Kang "A Study on Conditional Features for a Flow-based Neural Vocoder" in Asilomar Conference on Signals, Systems, and Computers, 2020
110	International Conference	Soo-Whan Chung, Soyeon Choe, Joon Son Chung, Hong-Goo Kang "FaceFilter: Audio-visual speech separation using still images" in INTERSPEECH (*awarded Best Student Paper), 2020
109	International Conference	Soo-Whan Chung, Hong-Goo Kang, Joon Son Chung "Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision" in INTERSPEECH, 2020
108	International Conference	Hyewon Han, Soo-Whan Chung, Hong-Goo Kang "MIRNet: Learning multiple identities representations in overlapped speech" in INTERSPEECH, 2020
107	International Conference	Yoohwan Kwon, Soo-Whan Chung, Hong-Goo Kang "Intra-Class Variation Reduction of Speaker Representation in Disentanglement Framework" in INTERSPEECH, 2020
106	International Conference	Minh-Tri Ho, Jinyoung Lee, Bong-Ki Lee, Dong Hoon Yi, Hong-Goo Kang "A Cross-channel Attention-based Wave-U-Net for Multi-channel Speech Enhancement" in INTERSPEECH, 2020
105	International Conference	Seyun Um, Sangshin Oh, Kyungguen Byun, Inseon Jang, ChungHyun Ahn, Hong-Goo Kang "Emotional Speech Synthesis with Rich and Granularized Control" in ICASSP, 2020

Fixed-point implementation of MPEG-D unified speech and audio coding decoder

Previous

Sister Lab.

Yonsei University

Academic Website