번호
114 International Journal A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems 2019-05-02
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...  
113 International Conference Gradient-based active learning query strategy for end-to-end speech recognition 2019-02-07
In this paper, we propose an effective active learning query strategy for an automatic speech recognition system with the aim of reducing the training cost. Generally, training a deep neural network ...  
112 International Conference Perfect match: Improved cross-modal embeddings for audio-visual synchronisation 2019-02-07
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...  
111 International Conference Perfect match: Improved cross-modal embeddings for audio-visual synchronisation 2019-02-07
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...  
110 International Conference Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems 2019-01-24
This paper investigates how the perceptual quality of the synthesized speech is affected by reconstruction errors in excitation signals generated by a deep learning-based statistical model. In this fram...  
109 International Journal A Priori SNR Estimation Using Air- and Bone-Conduction Microphones 2019-01-24
This paper proposes an a priori signal-to-noise ratio (SNR) estimator using an air-conduction (AC) and a bone-conduction (BC) microphone. Among various ways of combining AC and BC microphones for spee...  
108 International Conference A Deep Learning-based Stress Detection Algorithm with Speech Signal 2018-09-27
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...  
107 International Conference A Deep Learning-based Stress Detection Algorithm with Speech Signal 2018-09-27
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...  
106 International Journal Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement 2018-07-13
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...  
105 International Journal Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement 2018-07-13
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...  
104 International Conference A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems 2018-06-04
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...  
103 International Conference A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems 2018-06-04
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...  
102 International Conference Deep Learning-Based Speech Presence Probability Estimation for Noise PSD Estimation in Single-Channel Speech Enhancement 2018-03-23
In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a ...  
101 International Conference Deep Learning-Based Speech Presence Probability Estimation for Noise PSD Estimation in Single-Channel Speech Enhancement 2018-03-23
In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a ...  
100 International Journal Generic Uniform Search Grid Generation Algorithm for Far-field Source Localization 1 2018-02-05
In this letter, a generic search grid generation algorithm for far-field source localization (SL) is proposed. Since conventional uniform regular grid structures only consider the resolution of the d...  
99 International Conference Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System 2018-02-05
This paper proposes a novel noise compensation algorithm for a glottal excitation model in a deep learning (DL)-based speech synthesis system. To generate high-quality speech synthesis outputs, the bala...  
98 International Conference DNN-based Wireless Positioning in An Outdoor Environment 2018-02-05
In this paper, we propose a deep learning based algorithm to estimate the position of an user by utilizing reference signal received power (RSRP) and the location of base stations. To obtain relia...  
97 International Conference Two Electrode based Healthcare Device for Continuously Monitoring ECG and BIA Signals 2017-12-18
In this study, we propose an effective wearable device to continuously and reliably monitor electrocardiogram (ECG) and body impedance (BI) that are two most important bio-signals for healthcare applicat...  
96 International Journal Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems 2017-10-13
In this paper, we report research results on modeling the parameters of an improved time-frequency trajectory excitation (ITFTE) and spectral envelopes of an LPC vocoder with a long short-term memory...  
95 International Journal Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems 2017-10-13
In this paper, we report research results on modeling the parameters of an improved time-frequency trajectory excitation (ITFTE) and spectral envelopes of an LPC vocoder with a long short-term memory...