Papers

Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition

International Conference
2006~2010
작성자
한혜원
작성일
2010-09-26 23:51
조회
1406
Authors : Ming Li, Chi-Sang Jung, Kyu J. Han

Year : 2010

Publisher / Conference : INTERSPEECH

Page : 2826-2829

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maximum likelihood linear regression (MLLR) matrix supervectors, (4) SVM based on GMM 'Tandem' supervectors, and (5) SVM baseline system based on the 450-dimensional feature vectors including prosodic features at the utterance level provided by the challenge organizing committee. To improve the overall classification performance, fusion of these five subsystems at the score level is performed. The proposed fusion system achieves 52.7% unweighted accuracy for the joint age-gender classification task and outperforms the GMM-MFCC system and SVM baseline, respectively, by 9.6% and 8.2% absolute improvement on the 2010 Interspeech Paralinguistic Challenge aGender database.
전체 355
205 Domestic Journal 현동일, 박영철, 윤대희 "가상 음원 이미징을 위한 향상된 진폭 패닝 기법" in 전자공학회논문지, vol.50, 제 3호, pp.139-145, 2013
204 International Conference Taegyu Lee, Seokjin Lee, Young-cheol Park, Dae Hee Youn "Virtual bass system based on a multiband harmonic generation" in ICCE, 2013
203 International Journal Chi-Sang Jung, Young-Sun Joo, Hong-Goo Kang "Waveform Interpolation-Based Speech Analysis/Synthesis for HMM-Based TTS Systems" in IEEE Signal Processing Letters, vol.19, issue 12, pp.809-812, 2012
202 International Conference Se-Woon Jeon, Dae Hee Youn, Young-Cheol Park "Blind depth estimation based on primary-to-ambient energy ratio for 3-D acoustic depth rendering" in APSIPA ASC, 2012
201 International Journal Dong-il Hyun, Young-Cheol Park, Dae Hee Youn "Estimation and quantization of ICC-dependent phase parameters for parametric stereo audio coding" in EURASIP Journal on Audio, Speech, and Music Processing, vol.27, 2012
200 International Conference Sunwoong Choi, Dong-il Hyun, Young-cheol Park, Seokpil Lee, Dae Hee Youn "Blind Upmixing for Height and Wide Channels Based on an Image Source Method" in 133th Convention of Audio Engineering Society, pp.8752, 2012
199 International Conference Ho Seon Shin, Hong-Goo Kang, Tim Fingscheidt "Survey of Speech Enhancement Supported by a Bone Conduction Microphone" in Speech Communication; 10. ITG Symposium, 2012
198 Domestic Journal 현동일, 이석필, 박영철, 윤대희 "반위상 주요성분에 기반을 둔 개선된 음수 채널간 상관도 파라미터 합성 기법" in 한국음향학회지, vol.31, 제 6호, pp.410-418, 2012
197 Domestic Journal 최선웅, 현동일, 이석필, 박영철, 윤대희 "입체음향효과 향상을 위한 스테레오-10.2채널 블라인드 업믹스 기법" in 한국음향학회지, vol.31, 제 5호, pp.340-351, 2012
196 International Journal Myung-Suk Song, Hong-Goo Kang "Single-channel dereverberation using a non-causal minimum variance distortionless response filter" in The Journal of the Acoustical Society of America, vol.132, issue 1, 2012