Papers

SC-ERM: Speaker-Centric Learning for Speech Emotion Recognition

International Conference
2021~
작성자
dsp
작성일
2024-01-22 16:12
조회
237
Authors : Juhwan Yoon, Seyun Um, Woo-Jin Chung, Hong-Goo Kang

Year : 2024

Publisher / Conference : International Conference on Electronics, Information, and Communication (ICEIC)

Research area : Speech Signal Processing, Etc

Presentation/Publication date : 2024.01.29

Presentation : Poster

We propose a novel deep learning-based model for speech emotion recognition, SC-ERM, which focuses on speakercentric learning. This model effectively estimates emotions and demonstrates the ability to generalize to unseen speakers. Our proposed model utilizes speaker-specific emotion characteristics in two steps: first, it extracts emotion representations using an emotion encoder, and second, it employs speaker-centric learning by incorporating speaker style embeddings as a condition through a speaker mask generator. We evaluate our model’s performance using an emotional dataset and find that it demonstrates outstanding performance in recognizing emotional states. Notably, it achieves a 9.2% relative improvement in accuracy compared to the baseline when classifying emotions for speakers not seen during training. Overall, our model demonstrates promising performance in accurately identifying emotions across a range of emotional expressions, irrespective of the speakers involved.
전체 355
7 International Conference Yeona Hong, Miseul Kim, Woo-Jin Chung, Hong-Goo Kang "Contextual Learning for Missing Speech Automatic Speech Recognition" in International Conference on Electronics, Information, and Communication (ICEIC), 2024
6 International Conference Zhenyu Piao, Hyungseob Lim, Miseul Kim, Hong-goo Kang "PDF-NET: Pitch-adaptive Dynamic Filter Network for Intra-gender Speaker Verification" in APSIPA ASC, 2023
5 International Conference Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang "BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0" in The IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), 2023
4 International Conference Zhenyu Piao, Miseul Kim, Hyungchan Yoon, Hong-Goo Kang "HappyQuokka System for ICASSP 2023 Auditory EEG Challenge" in ICASSP, 2023
3 International Conference Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang "Style Modeling for Multi-Speaker Articulation-to-Speech" in ICASSP, 2023
2 International Conference Miseul Kim, Zhenyu Piao, Seyun Um, Ran Lee, Jaemin Joh, Seungshin Lee, Hong-Goo Kang "Light-Weight Speaker Verification with Global Context Information" in INTERSPEECH, 2022
1 International Conference Miseul Kim, Minh-Tri Ho, Hong-Goo Kang "Self-supervised Complex Network for Machine Sound Anomaly Detection" in EUSIPCO, 2021