Papers

Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information

International Journal

2006~2010

작성자

이진영

작성일

2010-08-01 14:25

조회

3767

Authors : Chi-Sang Jung, Moo Young Kim, Hong-Goo Kang

Year : 2010

Publisher / Conference : IEEE Transactions on Audio, Speech, and Language Processing

Volume : 18, issue 6

Page : 1332-1340

In this paper, an information theoretic approach to selecting feature frames for speaker recognition systems is proposed. A conventional approach in which the frame shift is fixed to around half of the frame length may not be the best choice, because the characteristics of the speech signal may rapidly change, especially at phonetic boundaries. Experimental results show that the recognition accuracy increases if the frame interval is directly controlled using phonetic information. By applying these results to the well-known fact that the recognition accuracy is directly correlated with the amount of mutual information, this paper suggests a novel feature frame selection method for speaker recognition. Specifically, feature frames are chosen to have minimum-redundancy within selected feature frames, but maximum-relevancy to speaker models. It is verified by experiments that the proposed method produces consistent improvement, especially in a speaker verification system. It is also robust against variations in acoustic environment.

« Performance Analysis of a Class of Single Channel Speech Enhancement Algorithms for Automatic Speech Recognition

A Two-channel Noise Estimator for Speech Enhancement in Highly Non-stationary Environment »

목록보기

전체 372

212	International Conference	Jae-Mo Yang, Hong-Goo Kang "Adaptive multichannel linear prediction based dereverberation in time-varying room environments" in EUSIPCO, 2013
211	Domestic Journal	Jae-Mo Yang, Weige Chen, Z. Zhang, Hong-Goo Kang "반향 음성 신호의 하모닉 모델링을 이용한 음질 예측 알고리즘" in 방송공학회논문지, vol.18, issue.6, pp.919-926, 2013.11
210	International Conference	JeeSok Lee, Frank Soong, Hong-Goo Kang "Source-Filter based Full-band Adaptive Harmonic Model and Its Application to Speech Prosody Modification" in INTERSPEECH, 2013
209	Domestic Conference	이태규, 백용현, 박영철, 윤대희 "스테레오-멀티채널 업믹스 시스템에서의 초기 반사음 생성 기법" in 한국방송공학회, 2013
208	International Journal	Seong-woo Kim, Young-Cheol Park, Dae Hee Youn "A variable step-size gradient adaptive lattice algorithm for multiple sinusoidal interference cancelation" in EURASIP Journal on Advances in Signal Processing, vol.106, 2013
207	International Conference	Young-Sun Joo, Chi-Sang Jung, Hong-Goo Kang "Enhancement of spectral clarity for HMM-based text-to-speech systems" in ICASSP, 2013
206	International Journal	Jung-In Lee, Jeung-Yoon Choi, Hong-Goo Kang "Refinement of Landmark Detection and Extraction of Articulator-Free Features for Knowledge-Based Speech Recognition" in IEICE Transactions on Information and Systems, vol.E96-D, No.3, pp.746-749, 2013
205	Domestic Journal	현동일, 박영철, 윤대희 "가상 음원 이미징을 위한 향상된 진폭 패닝 기법" in 전자공학회논문지, vol.50, 제 3호, pp.139-145, 2013
204	International Conference	Taegyu Lee, Seokjin Lee, Young-cheol Park, Dae Hee Youn "Virtual bass system based on a multiband harmonic generation" in ICCE, 2013
203	International Journal	Chi-Sang Jung, Young-Sun Joo, Hong-Goo Kang "Waveform Interpolation-Based Speech Analysis/Synthesis for HMM-Based TTS Systems" in IEEE Signal Processing Letters, vol.19, issue 12, pp.809-812, 2012

Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information

Previous

Sister Lab.

Yonsei University

Academic Website