Papers

Phonetically optimized speaker modeling for robust speaker recognition

International Journal
2006~2010
작성자
이진영
작성일
2009-09-01 14:23
조회
1236
Authors : Bong-Jin Lee, Jeung-Yoon Choi, Hong-Goo Kang

Year : 2009

Publisher / Conference : The Journal of the Acoustical Society of America

Volume : 126, issue 3

This paper proposes an efficient method to improve speaker recognition performance by dynamically controlling the ratio of phoneme class information. It utilizes the fact that each phoneme contains different amounts of speaker discriminative information that can be measured by mutual information. After classifying phonemes into five classes, the optimal ratio of each class in both training and testing processes is adjusted using a non-linear optimization technique, i.e., the Nelder–Mead method. Speaker identification results verify that the proposed method achieves 18% improvement in terms of error rate compared to a baseline system.
전체 355
175 Domestic Journal 전세운, 박영철, 이석필, 윤대희 "다채널 스피커 환경에서 가상 음원을 생성하기 위한 레벨 패닝 알고리즘" in 한국음향학회지, vol.30, 제 4호, pp.197-206, 2011
174 Domestic Journal Yoomi Hur, Young-Cheol Park, Seok-Pil Lee, Dae Hee Youn "Efficient Individualization Method of HRTFs Using Critical-band Based Spectral Cue Control" in 한국음향학회지, vol.30, 제 4호, pp.167-180, 2011
173 International Journal Chi-Sang Jung, Hyunson Seo, Hong-Goo Kang "Estimating Redundancy Information of Selected Features in Multi-dimensional Pattern Classification" in Pattern Recognition Letters, vol.32, issue 4, pp.590-596, 2011
172 Domestic Journal 최민석, 신호선, 황영수, 강홍구 "음성 신호에서의 시간-주파수 축 충격 잡음 검출 시스템" in 한국음향학회지, vol.30, 제 2호, pp.73-79, 2011
171 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
170 International Journal Dong-il Hyun, Donggeum Lee, Youngcheol Park, Dae Hee Youn, Jeongil Seo "Joint Channel Coding Based on Principal Component Analysis" in ETRI Journal, vol.32, issue 5, pp.831-834, 2010
169 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
168 International Conference Ming Li, Chi-Sang Jung, Kyu J. Han "Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition" in INTERSPEECH, pp.2826-2829, 2010
167 Domestic Journal 송정욱, 오현오, 강홍구 "통합 음성/오디오 부호화를 위한 새로운 MPEG 참조 모델" in 전자공학회논문지, vol.47 SP, 제 5호, pp.74-80, 2010
166 Domestic Journal 전세운, 박영철, 윤대희 "다채널 포맷 변환과 공간적인 입체 음향 정보의 효과적인 유지에 대한 연구" in 전자공학회논문지, vol.47 SP, 제 5호, pp.34-44, 2010