304 |
Domestic Journal
k-평균 알고리즘을 활용한 음성의 대표 감정 스타일 결정 방법
|
2019-12-17 |
In this paper, we propose a method to effectively determine the representative style embedding of each emotion class to improve the global style token-based end-to-end speech synthesis system. The emot...
|
303 |
International Conference
A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis
|
2019-11-25 |
In this paper, we investigate the variation in the
performance of a deep learning-based speech synthesis (DLSS)
system based on the configuration of output acoustic parameters.
Our method is mainly applicable...
|
302 |
International Journal
An Effective Style Token Weight Control Technique for End-to-End Emotional Speech Synthesis
|
2019-08-10 |
In this letter, we propose a high-quality emotional speech synthesis system, using emotional vector space, i.e., the weighted sum of global style tokens (GSTs). Our previous research verified the feasibilit...
|
301 |
International Journal
Dry Electrode-Based Body Fat Estimation System with Anthropometric Data for Use in a Wearable Device
|
2019-07-18 |
The bioelectrical impedance analysis (BIA) method is widely used to predict percent bodyfat (PBF). However, it requires four to eight electrodes, and it takes a few minutes to accuratelyobtain the mea...
|
300 |
International Conference
Model Order Selection for Wind Noise Reduction in Non-negative Matrix Factorization
|
2019-06-18 |
In this paper, we propose a wind noise reduction method based on various types of non-negative matrix factorization (NMF) approaches. Since wind noise has highly non- stationary spectral characterist...
|
299 |
International Conference
Emotional Speech Synthesis Based on Style Embedded Tacotron2 Framework
|
2019-06-18 |
In this paper, we propose a speech synthesis system that effectively generates multiple types of emotional speech using the concept of global style token (GST); where the emotion-related style informati...
|
298 |
International Conference
Excitation-by-SampleRNN Model for Text-to-Speech
|
2019-06-18 |
.In this paper, we propose a neural vocoder-based textto-speech (TTS) system that effectively utilizes a source-filter modeling framework. Although neural vocoder algorithms such as SampleRNN and WaveNet ar...
|
297 |
International Conference
Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment
|
2019-06-18 |
In this paper, we propose a deep learning (DL)-based parameter enhancement method for a mixed excitation linear prediction (MELP) speech codec in noisy communication environment.Unlike conventional...
|
296 |
International Journal
A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems
|
2019-05-02 |
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...
|
295 |
International Conference
Gradient-based active learning query strategy for end-to-end speech recognition
|
2019-02-07 |
In this paper, we propose an effective active learning query strategy for an automatic speech recognition system with the aim of reducing the training cost. Generally, training a deep neural network ...
|
294 |
International Conference
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
|
2019-02-07 |
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...
|
293 |
International Conference
Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems
|
2019-01-24 |
This paper investigates how the perceptual quality of the synthesized speech is affected by reconstruction errors in excitation signals generated by a deep learning-based statistical model. In this fram...
|
292 |
International Journal
A Priori SNR Estimation Using Air- and Bone-Conduction Microphones
|
2019-01-24 |
This paper proposes an a priori signal-to-noise ratio (SNR) estimator using an air-conduction (AC) and a bone-conduction (BC) microphone. Among various ways of combining AC and BC microphones for spee...
|
291 |
International Conference
A Deep Learning-based Stress Detection Algorithm with Speech Signal
|
2018-09-27 |
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...
|
290 |
Domestic Conference
비학습 데이터 적응화 기법을 이용한 딥러닝 기반 한국어 음성 인식 기술
|
2018-08-25 |
제35회 음성통신 및 신호처리 학술대회
|
289 |
Domestic Conference
임베딩 매트릭스를 기반으로 한 비정상적 잡음 제거 알고리즘의 분석과 딥러닝 음질개선 방법들과의 성능비교
|
2018-08-25 |
제35회 음성통신 및 신호처리 학술대회
|
288 |
International Journal
Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement
|
2018-07-13 |
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...
|
287 |
International Conference
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems
|
2018-06-04 |
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...
|
286 |
International Conference
Deep Learning-Based Speech Presence Probability Estimation for Noise PSD Estimation in Single-Channel Speech Enhancement
|
2018-03-23 |
In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a ...
|
285 |
International Journal
Generic Uniform Search Grid Generation Algorithm for Far-field Source Localization
1
|
2018-02-05 |
In this letter, a generic search grid generation algorithm for far-field source localization (SL) is proposed. Since conventional uniform regular grid structures only consider the resolution of the d...
|