Papers

Enhancing loudspeaker-based 3D audio with room modeling

International Conference

2006~2010

작성자

한혜원

작성일

2010-10-04 23:47

조회

1356

Authors : Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang

Year : 2010

Publisher / Conference : MMSP

For many years, spatial (3D) sound using headphones has been widely used in a number of applications. A rich spatial sensation is obtained by using head related transfer functions (HRTF) and playing the appropriate sound through headphones. In theory, loudspeaker audio systems would be capable of rendering 3D sound fields almost as rich as headphones, as long as the room impulse responses (RIRs) between the loudspeakers and the ears are known. In practice, however, obtaining these RIRs is hard, and the performance of loudspeaker based systems is far from perfect. New hope has been recently raised by a system that tracks the user's head position and orientation, and incorporates them into the RIRs estimates in real time. That system made two simplifying assumptions: it used generic HRTFs, and it ignored room reverberation. In this paper we tackle the second problem: we incorporate a room reverberation estimate into the RIRs. Note that this is a nontrivial task: RIRs vary significantly with the listener's positions, and even if one could measure them at a few points, they are notoriously hard to interpolate. Instead, we take an indirect approach: we model the room, and from that model we obtain an estimate of the main reflections. Position and characteristics of walls do not vary with the users' movement, yet they allow to quickly compute an estimate of the RIR for each new user position. Of course the key question is whether the estimates are good enough. We show an improvement in localization perception of up to 32% (i.e., reducing average error from 23.5° to 15.9°).

« A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification

Enhanced long-term predictor for Unified Speech and Audio Coding »

목록보기

전체 355

72	International Conference	Eunwoo Song, Hong-Goo Kang, Joonil Lee "Fixed-point implementation of MPEG-D unified speech and audio coding decoder" in 19th International Conference on Digital Signal Processing (DSP), pp.110-113, 2014
71	International Conference	Soonho Baek, Hong-Goo Kang "Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment" in ICASSP, 2014
70	International Conference	Ho Seon Shin, Hong-Goo Kang "Bone-Conduction Speech Enhancement using a Speaker-Independent Filter" in ICEIC, 2014
69	International Conference	Soonho Baek, Hong-Goo Kang "Vector Taylor Series based HMM Adaptation for Generalized Cepstrum in Noisy Environment" in ASRU, 2013
68	International Conference	Jung-Won Lee, Hong-Goo Kang, Samuel Kim, Yoonjae Lee "Detecting pathological speech using local and global characteristics of harmonic-to-noise ratio" in APSIPA, 2013
67	International Conference	Eunwoo Song, Jongyoub Ryu, Hong-Goo Kang "Speech enhancement for pathological voice using time-frequency trajectory excitation modeling" in APSIPA, 2013
66	International Conference	Jinkyu Lee, Hyunson Seo, Hong-Goo Kang "Adaptation of HMM dynamic parameters in reverberant environment" in EUSIPCO, 2013
65	International Conference	Jae-Mo Yang, Hong-Goo Kang "Adaptive multichannel linear prediction based dereverberation in time-varying room environments" in EUSIPCO, 2013
64	International Conference	JeeSok Lee, Frank Soong, Hong-Goo Kang "Source-Filter based Full-band Adaptive Harmonic Model and Its Application to Speech Prosody Modification" in INTERSPEECH, 2013
63	International Conference	Young-Sun Joo, Chi-Sang Jung, Hong-Goo Kang "Enhancement of spectral clarity for HMM-based text-to-speech systems" in ICASSP, 2013

Enhancing loudspeaker-based 3D audio with room modeling

Previous

Sister Lab.

Yonsei University

Academic Website