Papers

Enhancing loudspeaker-based 3D audio with room modeling

International Conference
2006~2010
작성자
한혜원
작성일
2010-10-04 23:47
조회
1292
Authors : Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang

Year : 2010

Publisher / Conference : MMSP

For many years, spatial (3D) sound using headphones has been widely used in a number of applications. A rich spatial sensation is obtained by using head related transfer functions (HRTF) and playing the appropriate sound through headphones. In theory, loudspeaker audio systems would be capable of rendering 3D sound fields almost as rich as headphones, as long as the room impulse responses (RIRs) between the loudspeakers and the ears are known. In practice, however, obtaining these RIRs is hard, and the performance of loudspeaker based systems is far from perfect. New hope has been recently raised by a system that tracks the user's head position and orientation, and incorporates them into the RIRs estimates in real time. That system made two simplifying assumptions: it used generic HRTFs, and it ignored room reverberation. In this paper we tackle the second problem: we incorporate a room reverberation estimate into the RIRs. Note that this is a nontrivial task: RIRs vary significantly with the listener's positions, and even if one could measure them at a few points, they are notoriously hard to interpolate. Instead, we take an indirect approach: we model the room, and from that model we obtain an estimate of the main reflections. Position and characteristics of walls do not vary with the users' movement, yet they allow to quickly compute an estimate of the RIR for each new user position. Of course the key question is whether the estimates are good enough. We show an improvement in localization perception of up to 32% (i.e., reducing average error from 23.5° to 15.9°).
전체 355
138 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
137 International Journal Dong-il Hyun, Donggeum Lee, Youngcheol Park, Dae Hee Youn, Jeongil Seo "Joint Channel Coding Based on Principal Component Analysis" in ETRI Journal, vol.32, issue 5, pp.831-834, 2010
136 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
135 International Conference Ming Li, Chi-Sang Jung, Kyu J. Han "Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition" in INTERSPEECH, pp.2826-2829, 2010
134 Domestic Journal 송정욱, 오현오, 강홍구 "통합 음성/오디오 부호화를 위한 새로운 MPEG 참조 모델" in 전자공학회논문지, vol.47 SP, 제 5호, pp.74-80, 2010
133 Domestic Journal 전세운, 박영철, 윤대희 "다채널 포맷 변환과 공간적인 입체 음향 정보의 효과적인 유지에 대한 연구" in 전자공학회논문지, vol.47 SP, 제 5호, pp.34-44, 2010
132 Domestic Journal 오현오, 정양원 "객체 오디오 부호화 표준 SAOC 기술 및 응용" in 전자공학회논문지, vol.47 SP, 제 5호, pp.45-55, 2010
131 Domestic Conference 서현선, 정치상, 강홍구 "음소 특성 기반 스코어의 퓨전 방식을 이용한 서포트 벡터 머신 기반 화자 검증 시스템" in 한국음향학회, 2010
130 Domestic Conference 신호선, 최가원, 강홍구 "잡음 환경에서의 SNR 회복 기법을 적용한 음성 향상 알고리즘을 이용한 감정인식" in 음성통신 및 신호처리학술대회, vol.27, no. 1, 2010
129 International Journal Chi-Sang Jung, Moo Young Kim, Hong-Goo Kang "Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information" in IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue 6, pp.1332-1340, 2010