번호
122 International Conference A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis 2019-11-25
In this paper, we investigate the variation in the performance of a deep learning-based speech synthesis (DLSS) system based on the configuration of output acoustic parameters. Our method is mainly applicable...  
121 International Journal An Effective Style Token Weight Control Technique for End-to-End Emotional Speech Synthesis 2019-08-10
In this letter, we propose a high-quality emotional speech synthesis system, using emotional vector space, i.e., the weighted sum of global style tokens (GSTs). Our previous research verified the feasibilit...  
120 International Journal Dry Electrode-Based Body Fat Estimation System with Anthropometric Data for Use in a Wearable Device 2019-07-18
The bioelectrical impedance analysis (BIA) method is widely used to predict percent bodyfat (PBF). However, it requires four to eight electrodes, and it takes a few minutes to accuratelyobtain the mea...  
119 International Conference Model Order Selection for Wind Noise Reduction in Non-negative Matrix Factorization 2019-06-18
In this paper, we propose a wind noise reduction method based on various types of non-negative matrix factorization (NMF) approaches. Since wind noise has highly non- stationary spectral characterist...  
118 International Conference Emotional Speech Synthesis Based on Style Embedded Tacotron2 Framework 2019-06-18
In this paper, we propose a speech synthesis system that effectively generates multiple types of emotional speech using the concept of global style token (GST); where the emotion-related style informati...  
117 International Conference Excitation-by-SampleRNN Model for Text-to-Speech 2019-06-18
.In this paper, we propose a neural vocoder-based textto-speech (TTS) system that effectively utilizes a source-filter modeling framework. Although neural vocoder algorithms such as SampleRNN and WaveNet ar...  
116 International Conference Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment 2019-06-18
In this paper, we propose a deep learning (DL)-based parameter enhancement method for a mixed excitation linear prediction (MELP) speech codec in noisy communication environment.Unlike conventional...  
115 International Journal A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems 2019-05-02
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...  
114 International Journal A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems 2019-05-02
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...  
113 International Conference Gradient-based active learning query strategy for end-to-end speech recognition 2019-02-07
In this paper, we propose an effective active learning query strategy for an automatic speech recognition system with the aim of reducing the training cost. Generally, training a deep neural network ...  
112 International Conference Perfect match: Improved cross-modal embeddings for audio-visual synchronisation 2019-02-07
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...  
111 International Conference Perfect match: Improved cross-modal embeddings for audio-visual synchronisation 2019-02-07
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...  
110 International Conference Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems 2019-01-24
This paper investigates how the perceptual quality of the synthesized speech is affected by reconstruction errors in excitation signals generated by a deep learning-based statistical model. In this fram...  
109 International Journal A Priori SNR Estimation Using Air- and Bone-Conduction Microphones 2019-01-24
This paper proposes an a priori signal-to-noise ratio (SNR) estimator using an air-conduction (AC) and a bone-conduction (BC) microphone. Among various ways of combining AC and BC microphones for spee...  
108 International Conference A Deep Learning-based Stress Detection Algorithm with Speech Signal 2018-09-27
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...  
107 International Conference A Deep Learning-based Stress Detection Algorithm with Speech Signal 2018-09-27
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...  
106 International Journal Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement 2018-07-13
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...  
105 International Journal Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement 2018-07-13
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...  
104 International Conference A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems 2018-06-04
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...  
103 International Conference A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems 2018-06-04
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...