Papers

Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition

International Conference
2006~2010
작성자
한혜원
작성일
2010-09-26 23:51
조회
3797
Authors : Ming Li, Chi-Sang Jung, Kyu J. Han

Year : 2010

Publisher / Conference : INTERSPEECH

Page : 2826-2829

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maximum likelihood linear regression (MLLR) matrix supervectors, (4) SVM based on GMM 'Tandem' supervectors, and (5) SVM baseline system based on the 450-dimensional feature vectors including prosodic features at the utterance level provided by the challenge organizing committee. To improve the overall classification performance, fusion of these five subsystems at the score level is performed. The proposed fusion system achieves 52.7% unweighted accuracy for the joint age-gender classification task and outperforms the GMM-MFCC system and SVM baseline, respectively, by 9.6% and 8.2% absolute improvement on the 2010 Interspeech Paralinguistic Challenge aGender database.
전체 371
181 Domestic Conference 주영선,정치상,강홍구 "운율 경계 정보를 이용한 HMM 기반의 한국어 음성합성 시스템" in 한국방송공학회, 2011
180 International Journal Jae-seong Lee, Young-Cheol Park, Dae Hee Youn, Kyung-ok Kang "Efficient Windowing Scheme for MDCT-Based TCX in AMR-WB+" in IEICE Transactions on Information and Systems, vol.E94-D, No.6, pp.1341-1344, 2011
179 International Journal Yoomi Hur, Jonathan S. Abel, Young-Cheol Park, Dae Hee Youn "Techniques for Synthetic Reconfiguration of Microphone Arrays" in Journal of the AES, vol.59, issue 6, pp.404-418, 2011
178 International Conference Dong-il Hyun, Jeongil Seo, Young-cheol Park, Dae Hee Youn "Improved phase parameter analysis and synthesis for parametric stereo audio coding" in ICASSP, 2011
177 International Conference Jeongook Song, Hyen-o Oh, Hong-Goo Kong "Enhanced long-term predictor for Unified Speech and Audio Coding" in ICASSP, 2011
176 Domestic Conference 노훈동,김성우,이충용,윤대희 "근거리장 방위각 추정을 위한 원거리장 근사화 기법 성능 평가 및 분석" in 한국음향학회, 2011
175 Domestic Journal 전세운, 박영철, 이석필, 윤대희 "다채널 스피커 환경에서 가상 음원을 생성하기 위한 레벨 패닝 알고리즘" in 한국음향학회지, vol.30, 제 4호, pp.197-206, 2011
174 Domestic Journal Yoomi Hur, Young-Cheol Park, Seok-Pil Lee, Dae Hee Youn "Efficient Individualization Method of HRTFs Using Critical-band Based Spectral Cue Control" in 한국음향학회지, vol.30, 제 4호, pp.167-180, 2011
173 International Journal Chi-Sang Jung, Hyunson Seo, Hong-Goo Kang "Estimating Redundancy Information of Selected Features in Multi-dimensional Pattern Classification" in Pattern Recognition Letters, vol.32, issue 4, pp.590-596, 2011
172 Domestic Journal 최민석, 신호선, 황영수, 강홍구 "음성 신호에서의 시간-주파수 축 충격 잡음 검출 시스템" in 한국음향학회지, vol.30, 제 2호, pp.73-79, 2011