Papers

A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification

International Conference
2006~2010
작성자
한혜원
작성일
2010-09-26 23:52
조회
1413
Authors : Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang

Year : 2010

Publisher / Conference : INTERPSEECH

Page : 2754-2757

In this paper, we propose a spectral kurtosis based approach to extract features with a variable frame length and rate for speaker verification. Since the speaker-specific information of features in each frame changes depending upon the characteristics of speech, it is important to determine the appropriate frame length and rate to extract the salient feature frames. In order to distinctively represent the characteristics of vowels and consonants both in time and frequency domains, we introduce a variable frame length and rate (VFLR) method based on spectral kurtosis, which provides a local measure of time-frequency concentration. Experimental results verify that the proposed VFLR method improves the performance of the speaker verification system on the NIST SRE-06 database by 9.725% (relative) compared to the feature extraction method with the fixed length and rate.
전체 355
46 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
45 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
44 International Conference Ming Li, Chi-Sang Jung, Kyu J. Han "Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition" in INTERSPEECH, pp.2826-2829, 2010
43 International Conference Myung-Suk Song , Cha Zhang, Dinei Florencio, Hong-Goo Kang "Personal 3D audio system with loudspeakers" in ICME, 2010
42 International Conference Se-Woon Jeon, Young-Cheol Park, Seok-Pil Lee, Dae Hee Youn "Robust Representation of Spatial Sound in Stereo-to-Multichannel Upmix" in 128th Convention of Audio Engineering Society, pp.7976, 2010
41 International Conference Se-Woon Jeon, Dongil Hyun, Jeongil Seo, Young-Cheol Park, Dae Hee Youn "Enhancement of principal to ambient energy ratio for PCA-based parametric audio coding" in ICASSP, 2010
40 International Conference Ho Seon Shin, Min-Seok Choi, Taesu Kim, Hong-Goo Kang "Binaural loudness based speech reinforcement with a closed-form solution" in ICASSP, 2010
39 International Conference SoonHo Beak, Myung-Suk Song, Seok-Pil Lee, Hong-Goo Kang "Speaker Array System Based on Equalization Method with a Quiet Zone" in 127th Convention of Audio Engineering Society, pp.7951, 2009
38 International Conference Dongil Hyun, Jeongil Seo, Youngcheol Park, Daehee Youn "Robust Interchannel Correlation (ICC) Estimation Using Constant Interchannel Time Difference (ICTD) Compensation" in 127th Convention of Audio Engineering Society, pp.7934, 2009
37 International Conference Jae-Mo Yang, Chang-Heon Lee, Hong-Goo Kang "A robust time difference of arrival estimator in reverberant environments" in EUSIPCO, 2009