Papers

Vowel place detection for a knowledge‐based speech recognition system

International Journal
2006~2010
작성자
한혜원
작성일
2008-06-01 23:30
조회
99
Authors : Sukmyung Lee, Jeung-Yoon Choi

Year : 2008

Publisher / Conference : The Journal of the Acoustical Society of America

Volume : 123, issue 5

This work aims to detect vowel place as part of a knowledge‐based speech recognition system. Vowel place was classified into 6 groups based on tongue advancement [Front/Back] and height [High/Mid/Low]. Experiments were performed using 300 /hVd/ utterance data from Hillenbrand [J. Acoust. Soc. Am. 97, 3099‐3111] and 6600 TIMIT vowels. Features used include fundamental frequency (F0) and formant value (F1̃F3), where formant measurements were classified into separate groups using F0 measurements. The nearest class was found using a simple Mahalanobis distance measure, and yielded a 91.5% classification rate for the /hVd/ data. The results for the TIMIT data were 64.4%, and error analysis with regard to adjacent segment manner and place was carried out to observe the effects of coarticulation, which was not observed in the /hVd/ data.
전체 319
319 International Conference Jinyoung Lee and Hong-Goo Kang "Stacked U-Net with High-level Feature Transfer for Parameter Efficient Speech Enhancement" in APSIPA ASC, 2021
318 International Conference Huu-Kim Nguyen, Kihyuk Jeong, Se-Yun Um, Min-Jae Hwang, Eunwoo Song, Hong-Goo Kang "LiteTTS: A Decoder-free Light-weight Text-to-wave Synthesis Based on Generative Adversarial Networks" in INTERSPEECH, 2021
317 International Conference Zainab Alhakeem, Yoohwan Kwon, Hong-Goo Kang "Disentangled Representations for Arabic Dialect Identification based on Supervised Clustering with Triplet Loss" in EUSIPCO, 2021
316 International Conference Miseul Kim, Minh-Tri Ho, Hong-Goo Kang "Self-supervised Complex Network for Machine Sound Anomaly Detection" in EUSIPCO, 2021
315 International Conference Kihyuk Jeong, Huu-Kim Nguyen, Hong-Goo Kang "A Fast and Lightweight Text-To-Speech Model with Spectrum and Waveform Alignment Algorithms" in EUSIPCO, 2021
314 International Conference Jiyoung Lee*, Soo-Whan Chung*, Sunok Kim, Hong-Goo Kang**, Kwanghoon Sohn** "Looking into Your Speech: Learning Cross-modal Affinity for Audio-visual Speech Separation" in CVPR, 2021
313 International Conference Zainab Alhakeem, Hong-Goo Kang "Confidence Learning from Noisy Labels for Arabic Dialect Identification" in ITC-CSCC, 2021
312 International Conference Huu-Kim Nguyen, Kihyuk Jeong, Hong-Goo Kang "Fast and Lightweight Speech Synthesis Model based on FastSpeech2" in ITC-CSCC, 2021
311 International Conference Yoohwan Kwon*, Hee-Soo Heo*, Bong-Jin Lee, Joon Son Chung "The ins and outs of speaker recognition: lessons from VoxSRC 2020" in ICASSP, 2021
310 International Conference You Jin Kim, Hee Soo Heo, Soo-Whan Chung, Bong-Jin Lee "End-to-end Lip Synchronisation Based on Pattern Classification" in IEEE Spoken Language Technology Workshop (SLT), 2020