Papers

Consideration of Varying Training Lengths for Short-Duration Speaker Verification

International Conference

작성자

dsp

작성일

2023-09-12 14:23

조회

2506

Authors : WooSeok Ko, Seyun Um, Zhenyu Piao, Hong-goo Kang

Year : 2023

Publisher / Conference : APSIPA ASC

Research area : Speech Signal Processing, Speaker Recognition

Presentation : Poster

We present an efficient training scheme for speaker verification (SV) networks in short-duration speech input scenarios. We analyze the effects of varying training lengths on SV performance, with a particular focus on short utterances. Despite the high demand for short-duration SV in real-world applications, state-of-the-art SV systems have primarily been evaluated on long utterances, and little research has been conducted on shortduration SV. By considering the innate characteristics of SV architectures and the performance discrepancies associated with varying training data lengths, we propose a training scheme that accounts for varying length conditions. We categorize speaker characteristics as coarse-grained and fine-grained features and demonstrate that training models to learn both features can result in length-robust speaker embeddings. Our proposed training scheme improves model performance by 28.7% and 37.9% in terms of equal error rate on short-duration speech scenarios compared to baseline models.

« SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems

PDF-NET: Pitch-adaptive Dynamic Filter Network for Intra-gender Speaker Verification »

목록보기

전체 372

International Conference

WooSeok Ko, Seyun Um, Zhenyu Piao, Hong-goo Kang "Consideration of Varying Training Lengths for Short-Duration Speaker Verification" in APSIPA ASC, 2023

Consideration of Varying Training Lengths for Short-Duration Speaker Verification

Previous

Sister Lab.

Yonsei University

Academic Website