Papers

Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition

International Conference
2006~2010
작성자
한혜원
작성일
2010-09-26 23:51
조회
1829
Authors : Ming Li, Chi-Sang Jung, Kyu J. Han

Year : 2010

Publisher / Conference : INTERSPEECH

Page : 2826-2829

This paper presents a novel automatic speaker age and gender identification approach which combines five different methods at the acoustic level to improve the baseline performance. The five subsystems are (1) Gaussian mixture model (GMM) system based on mel-frequency cepstral coefficient (MFCC) features, (2) Support vector machine (SVM) based on GMM mean supervectors, (3) SVM based on GMM maximum likelihood linear regression (MLLR) matrix supervectors, (4) SVM based on GMM 'Tandem' supervectors, and (5) SVM baseline system based on the 450-dimensional feature vectors including prosodic features at the utterance level provided by the challenge organizing committee. To improve the overall classification performance, fusion of these five subsystems at the score level is performed. The proposed fusion system achieves 52.7% unweighted accuracy for the joint age-gender classification task and outperforms the GMM-MFCC system and SVM baseline, respectively, by 9.6% and 8.2% absolute improvement on the 2010 Interspeech Paralinguistic Challenge aGender database.
전체 364
68 International Conference Jung-Won Lee, Hong-Goo Kang, Samuel Kim, Yoonjae Lee "Detecting pathological speech using local and global characteristics of harmonic-to-noise ratio" in APSIPA, 2013
67 International Conference Eunwoo Song, Jongyoub Ryu, Hong-Goo Kang "Speech enhancement for pathological voice using time-frequency trajectory excitation modeling" in APSIPA, 2013
66 International Conference Jinkyu Lee, Hyunson Seo, Hong-Goo Kang "Adaptation of HMM dynamic parameters in reverberant environment" in EUSIPCO, 2013
65 International Conference Jae-Mo Yang, Hong-Goo Kang "Adaptive multichannel linear prediction based dereverberation in time-varying room environments" in EUSIPCO, 2013
64 International Conference JeeSok Lee, Frank Soong, Hong-Goo Kang "Source-Filter based Full-band Adaptive Harmonic Model and Its Application to Speech Prosody Modification" in INTERSPEECH, 2013
63 International Conference Young-Sun Joo, Chi-Sang Jung, Hong-Goo Kang "Enhancement of spectral clarity for HMM-based text-to-speech systems" in ICASSP, 2013
62 International Conference Taegyu Lee, Seokjin Lee, Young-cheol Park, Dae Hee Youn "Virtual bass system based on a multiband harmonic generation" in ICCE, 2013
61 International Conference Se-Woon Jeon, Dae Hee Youn, Young-Cheol Park "Blind depth estimation based on primary-to-ambient energy ratio for 3-D acoustic depth rendering" in APSIPA ASC, 2012
60 International Conference Sunwoong Choi, Dong-il Hyun, Young-cheol Park, Seokpil Lee, Dae Hee Youn "Blind Upmixing for Height and Wide Channels Based on an Image Source Method" in 133th Convention of Audio Engineering Society, pp.8752, 2012
59 International Conference Ho Seon Shin, Hong-Goo Kang, Tim Fingscheidt "Survey of Speech Enhancement Supported by a Bone Conduction Microphone" in Speech Communication; 10. ITG Symposium, 2012