Papers

Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone

International Journal
2011~2015
작성자
이진영
작성일
2012-02-01 14:59
조회
1200
Authors : Jung-Won Lee, Jeung-Yoon Choi, Hong-Goo Kang

Year : 2012

Publisher / Conference : The Journal of the Acoustical Society of America

Volume : 131, issue 2

Knowledge-based speech recognition systems extract acoustic cues from the signal to identify speech characteristics. For channel-deteriorated telephone speech, acoustic cues, especially those for stop consonant place, are expected to be degraded or absent. To investigate the use of knowledge-based methods in degraded environments, feature extrapolation of acoustic-phonetic features based on Gaussian mixture models is examined. This process is applied to a stop place detection module that uses burst release and vowel onset cues for consonant-vowel tokens of English. Results show that classification performance is enhanced in telephone channel-degraded speech, with extrapolated acoustic-phonetic features reaching or exceeding performance using estimated Mel-frequency cepstral coefficients (MFCCs). Results also show acoustic-phonetic features may be combined with MFCCs for best performance, suggesting these features provide information complementary to MFCCs.
전체 355
195 International Conference Seong-woo Kim, Young-cheol Park, Dae Hee Youn "A variable step-size filtered-x gradient adaptive lattice algorithm for active noise control" in ICASSP, 2012
194 International Journal Jung-Won Lee, Jeung-Yoon Choi, Hong-Goo Kang "Classification of stop place in consonant-vowel contexts using feature extrapolation of acoustic-phonetic features in telephone" in The Journal of the Acoustical Society of America, vol.131, issue 2, 2012
193 International Conference Se-Woon Jeon, Young-cheol Park, Dae Hee Youn "Acoustic depth rendering for 3D multimedia applications" in ICCE, 2012
192 International Journal Jae-Mo Yang, Hong-Goo Kang "Two-stage source tracking method using a multiple linear regression model in the expanded phase domain" in EURASIP Journal on Advances in Signal Processing, vol.5, 2012
191 International Conference Jinkyu Lee, Soonho Baek, Hong-Goo Kang "Signal and feature domain enhancement approaches for robust speech recognition" in 8th International Conference on Information, communications and Signal Processing, 2011
190 International Journal Min-Seok Choi, Hong-Goo Kang "Transient noise reduction in speech signal with a modified long-term predictor" in EURASIP Journal on Advances in Signal Processing, vol.141, 2011
189 International Journal Myung-Suk Song, Cha Zhang, Dinei Florencio, Hong-Goo Kang "An Interactive 3-D Audio System With Loudspeakers" in IEEE Transactions on Multimedia, vol.13, issue 5, pp.844-855, 2011
188 International Conference Dong-il Hyun, Young-cheol Park, Seok-pil Lee, Dae Hee Youn "Enhanced Interchannel Correlation (ICC) Synthesis for Spatial Audio Coding" in AES 43th International Conference, 2011
187 International Journal Tacksung Choi, Young-Cheol Park, Dae Hee Youn, Seokpil Lee "Virtual Sound Rendering in a Stereophonic Loudspeaker Setup" in IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue 7, pp.1962-1974, 2011
186 International Conference Se-Woon Jeon, Young-cheol Park, Seok-Pil Lee, Dae Hee Youn "Virtual Source Panning using Multiple-Wise Vector Base in the Multispeaker Stereo Format" in EUSIPCO, pp.1337-1341, 2011