Papers

A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification

International Conference
2006~2010
작성자
한혜원
작성일
2010-09-26 23:52
조회
1879
Authors : Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang

Year : 2010

Publisher / Conference : INTERPSEECH

Page : 2754-2757

In this paper, we propose a spectral kurtosis based approach to extract features with a variable frame length and rate for speaker verification. Since the speaker-specific information of features in each frame changes depending upon the characteristics of speech, it is important to determine the appropriate frame length and rate to extract the salient feature frames. In order to distinctively represent the characteristics of vowels and consonants both in time and frequency domains, we introduce a variable frame length and rate (VFLR) method based on spectral kurtosis, which provides a local measure of time-frequency concentration. Experimental results verify that the proposed VFLR method improves the performance of the speaker verification system on the NIST SRE-06 database by 9.725% (relative) compared to the feature extraction method with the fixed length and rate.
전체 364
194 International Journal Jung-Won Lee, Jeung-Yoon Choi, Hong-Goo Kang "Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone" in The Journal of the Acoustical Society of America, vol.131, issue 2, 2012
193 International Conference Se-Woon Jeon, Young-cheol Park, Dae Hee Youn "Acoustic depth rendering for 3D multimedia applications" in ICCE, 2012
192 International Journal Jae-Mo Yang, Hong-Goo Kang "Two-stage source tracking method using a multiple linear regression model in the expanded phase domain" in EURASIP Journal on Advances in Signal Processing, vol.5, 2012
191 International Conference Jinkyu Lee, Soonho Baek, Hong-Goo Kang "Signal and feature domain enhancement approaches for robust speech recognition" in 8th International Conference on Information, communications and Signal Processing, 2011
190 International Journal Min-Seok Choi, Hong-Goo Kang "Transient noise reduction in speech signal with a modified long-term predictor" in EURASIP Journal on Advances in Signal Processing, vol.141, 2011
189 International Journal Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "An Interactive 3-D Audio System With Loudspeakers" in IEEE Transactions on Multimedia, vol.13, issue 5, pp.844-855, 2011
188 International Conference Dong-il Hyun, Young-cheol Park, Seok-pil Lee, Dae Hee Youn "Enhanced Interchannel Correlation (ICC) Synthesis for Spatial Audio Coding" in AES 43th International Conference, 2011
187 International Journal Tacksung Choi, Young-Cheol Park, Dae Hee Youn, Seokpil Lee "Virtual Sound Rendering in a Stereophonic Loudspeaker Setup" in IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue 7, pp.1962-1974, 2011
186 International Conference Se-Woon Jeon, Young-cheol Park, Seok-Pil Lee, Dae Hee Youn "Virtual Source Panning using Multiple-Wise Vector Base in the Multispeaker Stereo Format" in EUSIPCO, pp.1337-1341, 2011
185 Domestic Conference 신호선,양재모,강홍구 "GSC 빔포머의 실시간 구현을 위한 고정 소수점 연산화" in 한국음향학회, 2011