Papers

Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information

International Journal
2006~2010
작성자
이진영
작성일
2010-08-01 14:25
조회
1333
Authors : Chi-Sang Jung, Moo Young Kim, Hong-Goo Kang

Year : 2010

Publisher / Conference : IEEE Transactions on Audio, Speech, and Language Processing

Volume : 18, issue 6

Page : 1332-1340

In this paper, an information theoretic approach to selecting feature frames for speaker recognition systems is proposed. A conventional approach in which the frame shift is fixed to around half of the frame length may not be the best choice, because the characteristics of the speech signal may rapidly change, especially at phonetic boundaries. Experimental results show that the recognition accuracy increases if the frame interval is directly controlled using phonetic information. By applying these results to the well-known fact that the recognition accuracy is directly correlated with the amount of mutual information, this paper suggests a novel feature frame selection method for speaker recognition. Specifically, feature frames are chosen to have minimum-redundancy within selected feature frames, but maximum-relevancy to speaker models. It is verified by experiments that the proposed method produces consistent improvement, especially in a speaker verification system. It is also robust against variations in acoustic environment.
전체 355
138 International Conference Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "Enhancing loudspeaker-based 3D audio with room modeling" in MMSP, 2010
137 International Journal Dong-il Hyun, Donggeum Lee, Youngcheol Park, Dae Hee Youn, Jeongil Seo "Joint Channel Coding Based on Principal Component Analysis" in ETRI Journal, vol.32, issue 5, pp.831-834, 2010
136 International Conference Chi-Sang Jung, Kyu J. Han, Hyunson Seo, Shrikanth S. Narayanan, Hong-Goo Kang "A Variable Frame Length and Rate Algorithm Based on the Spectral Kurtosis Measure for Speaker Verification" in INTERPSEECH, pp.2754-2757, 2010
135 International Conference Ming Li, Chi-Sang Jung, Kyu J. Han "Combining Five Acoustic Level Modeling Methods for Automatic Speaker Age and Gender Recognition" in INTERSPEECH, pp.2826-2829, 2010
134 Domestic Journal 송정욱, 오현오, 강홍구 "통합 음성/오디오 부호화를 위한 새로운 MPEG 참조 모델" in 전자공학회논문지, vol.47 SP, 제 5호, pp.74-80, 2010
133 Domestic Journal 전세운, 박영철, 윤대희 "다채널 포맷 변환과 공간적인 입체 음향 정보의 효과적인 유지에 대한 연구" in 전자공학회논문지, vol.47 SP, 제 5호, pp.34-44, 2010
132 Domestic Journal 오현오, 정양원 "객체 오디오 부호화 표준 SAOC 기술 및 응용" in 전자공학회논문지, vol.47 SP, 제 5호, pp.45-55, 2010
131 Domestic Conference 서현선, 정치상, 강홍구 "음소 특성 기반 스코어의 퓨전 방식을 이용한 서포트 벡터 머신 기반 화자 검증 시스템" in 한국음향학회, 2010
130 Domestic Conference 신호선, 최가원, 강홍구 "잡음 환경에서의 SNR 회복 기법을 적용한 음성 향상 알고리즘을 이용한 감정인식" in 음성통신 및 신호처리학술대회, vol.27, no. 1, 2010
129 International Journal Chi-Sang Jung, Moo Young Kim, Hong-Goo Kang "Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information" in IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue 6, pp.1332-1340, 2010