Papers

Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information

International Journal
2006~2010
작성자
이진영
작성일
2010-08-01 14:25
조회
1390
Authors : Chi-Sang Jung, Moo Young Kim, Hong-Goo Kang

Year : 2010

Publisher / Conference : IEEE Transactions on Audio, Speech, and Language Processing

Volume : 18, issue 6

Page : 1332-1340

In this paper, an information theoretic approach to selecting feature frames for speaker recognition systems is proposed. A conventional approach in which the frame shift is fixed to around half of the frame length may not be the best choice, because the characteristics of the speech signal may rapidly change, especially at phonetic boundaries. Experimental results show that the recognition accuracy increases if the frame interval is directly controlled using phonetic information. By applying these results to the well-known fact that the recognition accuracy is directly correlated with the amount of mutual information, this paper suggests a novel feature frame selection method for speaker recognition. Specifically, feature frames are chosen to have minimum-redundancy within selected feature frames, but maximum-relevancy to speaker models. It is verified by experiments that the proposed method produces consistent improvement, especially in a speaker verification system. It is also robust against variations in acoustic environment.
전체 355
195 International Conference Seong-woo Kim, Young-cheol Park, Dae Hee Youn "A variable step-size filtered-x gradient adaptive lattice algorithm for active noise control" in ICASSP, 2012
194 International Journal Jung-Won Lee, Jeung-Yoon Choi, Hong-Goo Kang "Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone" in The Journal of the Acoustical Society of America, vol.131, issue 2, 2012
193 International Conference Se-Woon Jeon, Young-cheol Park, Dae Hee Youn "Acoustic depth rendering for 3D multimedia applications" in ICCE, 2012
192 International Journal Jae-Mo Yang, Hong-Goo Kang "Two-stage source tracking method using a multiple linear regression model in the expanded phase domain" in EURASIP Journal on Advances in Signal Processing, vol.5, 2012
191 International Conference Jinkyu Lee, Soonho Baek, Hong-Goo Kang "Signal and feature domain enhancement approaches for robust speech recognition" in 8th International Conference on Information, communications and Signal Processing, 2011
190 International Journal Min-Seok Choi, Hong-Goo Kang "Transient noise reduction in speech signal with a modified long-term predictor" in EURASIP Journal on Advances in Signal Processing, vol.141, 2011
189 International Journal Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "An Interactive 3-D Audio System With Loudspeakers" in IEEE Transactions on Multimedia, vol.13, issue 5, pp.844-855, 2011
188 International Conference Dong-il Hyun, Young-cheol Park, Seok-pil Lee, Dae Hee Youn "Enhanced Interchannel Correlation (ICC) Synthesis for Spatial Audio Coding" in AES 43th International Conference, 2011
187 International Journal Tacksung Choi, Young-Cheol Park, Dae Hee Youn, Seokpil Lee "Virtual Sound Rendering in a Stereophonic Loudspeaker Setup" in IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue 7, pp.1962-1974, 2011
186 International Conference Se-Woon Jeon, Young-cheol Park, Seok-Pil Lee, Dae Hee Youn "Virtual Source Panning using Multiple-Wise Vector Base in the Multispeaker Stereo Format" in EUSIPCO, pp.1337-1341, 2011