Papers

Fast and Lightweight Speech Synthesis Model based on FastSpeech2

International Conference
2021~
작성자
한혜원
작성일
2021-06-28 10:28
조회
647
Authors : Huu-Kim Nguyen, Kihyuk Jeong, Hong-Goo Kang

Year : 2021

Publisher / Conference : ITC-CSCC

Research area : Speech Signal Processing, Text-to-Speech

Presentation : 구두

In this paper, we present a fast and lightweight speech synthesis model that is suitable for on-device applications. By leveraging the techniques of long-short range attention, depth-wise separable convolution, and linear attention, we significantly reduce the model size and complexity of the baseline FastSpeech2-based Transformer framework. Unlike the baseline model that requires O(N^2) to compute attention and convolution operations because of nested-loop computations, our proposed model only requires O(N) computations due to the modification of a nested-loop into two cascaded single loops.
전체 329
2 International Conference Huu-Kim Nguyen, Kihyuk Jeong, Hong-Goo Kang "Fast and Lightweight Speech Synthesis Model based on FastSpeech2" in ITC-CSCC, 2021
1 International Conference Yoohwan Kwon*, Hee-Soo Heo*, Bong-Jin Lee, Joon Son Chung "The ins and outs of speaker recognition: lessons from VoxSRC 2020" in ICASSP, 2021