Papers

Consideration of Varying Training Lengths for Short-Duration Speaker Verification

International Conference
작성자
dsp
작성일
2023-09-12 14:23
조회
409
Authors : WooSeok Ko, Seyun Um, Zhenyu Piao, Hong-goo Kang

Year : 2023

Publisher / Conference : APSIPA ASC

Research area : Speech Signal Processing, Speaker Recognition

Presentation : Poster

We present an efficient training scheme for speaker verification (SV) networks in short-duration speech input scenarios. We analyze the effects of varying training lengths on SV performance, with a particular focus on short utterances. Despite the high demand for short-duration SV in real-world applications, state-of-the-art SV systems have primarily been evaluated on long utterances, and little research has been conducted on shortduration SV. By considering the innate characteristics of SV architectures and the performance discrepancies associated with varying training data lengths, we propose a training scheme that accounts for varying length conditions. We categorize speaker characteristics as coarse-grained and fine-grained features and demonstrate that training models to learn both features can result in length-robust speaker embeddings. Our proposed training scheme improves model performance by 28.7% and 37.9% in terms of equal error rate on short-duration speech scenarios compared to baseline models.
전체 355
1 International Conference WooSeok Ko, Seyun Um, Zhenyu Piao, Hong-goo Kang "Consideration of Varying Training Lengths for Short-Duration Speaker Verification" in APSIPA ASC, 2023