Papers

Enhancing loudspeaker-based 3D audio with room modeling

International Conference
2006~2010
작성자
한혜원
작성일
2010-10-04 23:47
조회
1356
Authors : Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang

Year : 2010

Publisher / Conference : MMSP

For many years, spatial (3D) sound using headphones has been widely used in a number of applications. A rich spatial sensation is obtained by using head related transfer functions (HRTF) and playing the appropriate sound through headphones. In theory, loudspeaker audio systems would be capable of rendering 3D sound fields almost as rich as headphones, as long as the room impulse responses (RIRs) between the loudspeakers and the ears are known. In practice, however, obtaining these RIRs is hard, and the performance of loudspeaker based systems is far from perfect. New hope has been recently raised by a system that tracks the user's head position and orientation, and incorporates them into the RIRs estimates in real time. That system made two simplifying assumptions: it used generic HRTFs, and it ignored room reverberation. In this paper we tackle the second problem: we incorporate a room reverberation estimate into the RIRs. Note that this is a nontrivial task: RIRs vary significantly with the listener's positions, and even if one could measure them at a few points, they are notoriously hard to interpolate. Instead, we take an indirect approach: we model the room, and from that model we obtain an estimate of the main reflections. Position and characteristics of walls do not vary with the users' movement, yet they allow to quickly compute an estimate of the RIR for each new user position. Of course the key question is whether the estimates are good enough. We show an improvement in localization perception of up to 32% (i.e., reducing average error from 23.5° to 15.9°).
전체 355
72 International Conference Eunwoo Song, Hong-Goo Kang, Joonil Lee "Fixed-point implementation of MPEG-D unified speech and audio coding decoder" in 19th International Conference on Digital Signal Processing (DSP), pp.110-113, 2014
71 International Conference Soonho Baek, Hong-Goo Kang "Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment" in ICASSP, 2014
70 International Conference Ho Seon Shin, Hong-Goo Kang "Bone-Conduction Speech Enhancement using a Speaker-Independent Filter" in ICEIC, 2014
69 International Conference Soonho Baek, Hong-Goo Kang "Vector Taylor Series based HMM Adaptation for Generalized Cepstrum in Noisy Environment" in ASRU, 2013
68 International Conference Jung-Won Lee, Hong-Goo Kang, Samuel Kim, Yoonjae Lee "Detecting pathological speech using local and global characteristics of harmonic-to-noise ratio" in APSIPA, 2013
67 International Conference Eunwoo Song, Jongyoub Ryu, Hong-Goo Kang "Speech enhancement for pathological voice using time-frequency trajectory excitation modeling" in APSIPA, 2013
66 International Conference Jinkyu Lee, Hyunson Seo, Hong-Goo Kang "Adaptation of HMM dynamic parameters in reverberant environment" in EUSIPCO, 2013
65 International Conference Jae-Mo Yang, Hong-Goo Kang "Adaptive multichannel linear prediction based dereverberation in time-varying room environments" in EUSIPCO, 2013
64 International Conference JeeSok Lee, Frank Soong, Hong-Goo Kang "Source-Filter based Full-band Adaptive Harmonic Model and Its Application to Speech Prosody Modification" in INTERSPEECH, 2013
63 International Conference Young-Sun Joo, Chi-Sang Jung, Hong-Goo Kang "Enhancement of spectral clarity for HMM-based text-to-speech systems" in ICASSP, 2013