Papers

Scalable Multiband Binaural Renderer for MPEG-H 3D Audio

International Journal
2011~2015
작성자
이진영
작성일
2015-08-01 22:05
조회
2688
Authors : Taegyu Lee, Hyun Oh Oh, Jeongil Seo, Young-Cheol Park, Dae Hee Youn

Year : 2015

Publisher / Conference : IEEE Journal of Selected Topics in Signal Processing

Volume : 9, issue 5

Page : 907-920

To provide immersive 3D multimedia service, MPEG has launched MPEG-H, ISO/IEC 23008, “High Efficiency Coding and Media Delivery in Heterogeneous Environments.” As part of the audio, MPEG-H 3D Audio has been standardized based on a multichannel loudspeaker configuration (e.g., 22.2). Binaural rendering is a key application of 3D audio; however, previous studies focus on binaural rendering with low complexity such as IIR filter design for HRTF or pre-/post-processing to solve in-head localization or front-back confusion. In this paper, a new binaural rendering algorithm is proposed to support the large number of input channel signals and provide high-quality in terms of timbre, parts of this algorithm were adopted into the MPEG-H 3D Audio. The proposed algorithm truncates binaural room impulse response at mixing time, the transition point from the early-reflections to the late reverberation part. Each part is processed independently by variable order filtering in frequency domain (VOFF) and parametric late reverberation filtering (PLF), respectively. Further, a QMF domain tapped delay line (QTDL) is proposed to reduce complexity in the high-frequency band, based on human auditory perception and codec characteristics. In the proposed algorithm, a scalability scheme is adopted to cover a wide range of applications by adjusting the threshold of mixing time. Experimental results show that the proposed algorithm is able to provide the audio quality of a binaural rendered signal using full-length binaural room impulse responses. A scalability test also shows that the proposed scalability scheme smoothly compromises between audio quality and computational complexity.
전체 367
237 International Journal Taegyu Lee, Hyun Oh Oh, Jeongil Seo, Young-Cheol Park, Dae Hee Youn "Scalable Multiband Binaural Renderer for MPEG-H 3D Audio" in IEEE Journal of Selected Topics in Signal Processing, vol.9, issue 5, pp.907-920, 2015
236 International Conference Heejin Ahn, Eunwoo Song, Won-Suk Jun, Hong-goo Kang "A Compression Algorithms for Hidden Markov Model-Based Speech Synthesis Systems" in ITC-CSCC, pp.942-945, 2015
235 International Conference JeeSok Lee, Sejin Oh, Hong-Goo Kang "Coherent channel based subband multichannel dereverberation" in ICASSP, pp.2704-2708, 2015
234 International Conference Eunwoo Song, Young-Sun Joo, Hong-Goo Kang "Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system" in ICASSP, 2015
233 Domestic Journal 박영철, 이태규, 윤대희 "MPEG-H 3D 오디오 바이노럴 렌더링 기술 표준화" in 대한전기학회, 전기의 세계, vol.64, 제 2호, pp.27-31, 2015
232 International Journal Taegyu Lee, Yonghyun Baek, Young-Cheol Park, Dae Hee Youn "Stereo upmix-based binaural auralization for mobile devices" in IEEE Transactions on Consumer Electronics, vol.60, issue 3, pp.411-419, 2014
231 International Conference Eunwoo Song, Hong-Goo Kang, Joonil Lee "Fixed-point implementation of MPEG-D unified speech and audio coding decoder" in 19th International Conference on Digital Signal Processing (DSP), pp.110-113, 2014
230 International Journal Soonho Baek, Hong-Goo Kang "Selection of spectral compressive operator for vector Taylor series-based model adaptation in noisy environments" in The Journal of the Acoustical Society of America, vol.135, 2014
229 International Conference Soonho Baek, Hong-Goo Kang "Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment" in ICASSP, 2014
228 International Journal Jae-Mo Yang, Hong-Goo Kang "Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction" in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue 3, pp.608-619, 2014