Papers

Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition

International Conference
2006~2010
작성자
한혜원
작성일
2010-09-26 23:51
조회
352
Authors : Ming Li, Chi-Sang Jung, Kyu J. Han

Year : 2010

Publisher / Conference : INTERSPEECH

Page : 2826-2829

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maximum likelihood linear regression (MLLR) matrix supervectors, (4) SVM based on GMM 'Tandem' supervectors, and (5) SVM baseline system based on the 450-dimensional feature vectors including prosodic features at the utterance level provided by the challenge organizing committee. To improve the overall classification performance, fusion of these five subsystems at the score level is performed. The proposed fusion system achieves 52.7% unweighted accuracy for the joint age-gender classification task and outperforms the GMM-MFCC system and SVM baseline, respectively, by 9.6% and 8.2% absolute improvement on the 2010 Interspeech Paralinguistic Challenge aGender database.
전체 327
177 International Conference Jeongook Song, Hyen-o Oh, Hong-Goo Kong "Enhanced long-term predictor for Unified Speech and Audio Coding" in ICASSP, 2011
176 Domestic Conference 노훈동,김성우,이충용,윤대희 "근거리장 방위각 추정을 위한 원거리장 근사화 기법 성능 평가 및 분석" in 한국음향학회, 2011
175 Domestic Journal 전세운, 박영철, 이석필, 윤대희 "다채널 스피커 환경에서 가상 음원을 생성하기 위한 레벨 패닝 알고리즘" in 한국음향학회지, vol.30, 제 4호, pp.197-206, 2011
174 Domestic Journal Yoomi Hur, Young-Cheol Park, Seok-Pil Lee, Dae Hee Youn "Efficient Individualization Method of HRTFs Using Critical-band Based Spectral Cue Control" in 한국음향학회지, vol.30, 제 4호, pp.167-180, 2011
173 International Journal Chi-Sang Jung, Hyunson Seo, Hong-Goo Kang "Estimating Redundancy Information of Selected Features in Multi-dimensional Pattern Classification" in Pattern Recognition Letters, vol.32, issue 4, pp.590-596, 2011
172 Domestic Journal 최민석, 신호선, 황영수, 강홍구 "음성 신호에서의 시간-주파수 축 충격 잡음 검출 시스템" in 한국음향학회지, vol.30, 제 2호, pp.73-79, 2011
171 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
170 International Journal Dong-il Hyun, Donggeum Lee, Youngcheol Park, Dae Hee Youn, Jeongil Seo "Joint Channel Coding Based on Principal Component Analysis" in ETRI Journal, vol.32, issue 5, pp.831-834, 2010
169 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
168 International Conference Ming Li, Chi-Sang Jung, Kyu J. Han "Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition" in INTERSPEECH, pp.2826-2829, 2010