번호
111 International Conference Perfect match: Improved cross-modal embeddings for audio-visual synchronisation 2019-02-07
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...  
110 International Conference Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems 2019-01-24
This paper investigates how the perceptual quality of the synthesized speech is affected by reconstruction errors in excitation signals generated by a deep learning-based statistical model. In this fram...  
109 International Journal A Priori SNR Estimation Using Air- and Bone-Conduction Microphones 2019-01-24
This paper proposes an a priori signal-to-noise ratio (SNR) estimator using an air-conduction (AC) and a bone-conduction (BC) microphone. Among various ways of combining AC and BC microphones for spee...  
108 International Conference A Deep Learning-based Stress Detection Algorithm with Speech Signal 2018-09-27
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...  
107 International Conference A Deep Learning-based Stress Detection Algorithm with Speech Signal 2018-09-27
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...  
106 International Journal Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement 2018-07-13
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...  
105 International Journal Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement 2018-07-13
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...  
104 International Conference A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems 2018-06-04
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...  
103 International Conference A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems 2018-06-04
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...  
102 International Conference Deep Learning-Based Speech Presence Probability Estimation for Noise PSD Estimation in Single-Channel Speech Enhancement 2018-03-23
In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a ...  
101 International Conference Deep Learning-Based Speech Presence Probability Estimation for Noise PSD Estimation in Single-Channel Speech Enhancement 2018-03-23
In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a ...  
100 International Journal Generic Uniform Search Grid Generation Algorithm for Far-field Source Localization 1 2018-02-05
In this letter, a generic search grid generation algorithm for far-field source localization (SL) is proposed. Since conventional uniform regular grid structures only consider the resolution of the d...  
99 International Conference Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System 2018-02-05
This paper proposes a novel noise compensation algorithm for a glottal excitation model in a deep learning (DL)-based speech synthesis system. To generate high-quality speech synthesis outputs, the bala...  
98 International Conference DNN-based Wireless Positioning in An Outdoor Environment 2018-02-05
In this paper, we propose a deep learning based algorithm to estimate the position of an user by utilizing reference signal received power (RSRP) and the location of base stations. To obtain relia...  
97 International Conference Two Electrode based Healthcare Device for Continuously Monitoring ECG and BIA Signals 2017-12-18
In this study, we propose an effective wearable device to continuously and reliably monitor electrocardiogram (ECG) and body impedance (BI) that are two most important bio-signals for healthcare applicat...  
96 International Journal Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems 2017-10-13
In this paper, we report research results on modeling the parameters of an improved time-frequency trajectory excitation (ITFTE) and spectral envelopes of an LPC vocoder with a long short-term memory...  
95 International Journal Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems 2017-10-13
In this paper, we report research results on modeling the parameters of an improved time-frequency trajectory excitation (ITFTE) and spectral envelopes of an LPC vocoder with a long short-term memory...  
94 International Journal SVD Based Adaptive QIM Watermarking on Stereo Audio Signals 2017-07-31
This paper proposes a blind digital audio watermarking algorithm that utilizes the quantization index modulation (QIM) and the singular value decomposition (SVD) of stereo audio signals. Conventional SVD-ba...  
93 International Conference Continuous Bladder Volume Monitoring System for Wearable Applications 2017-07-17
<style type="text/css"> p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 9.0px Helvetica} </style> In this research, we propose a bladder volume monitoring system that can be effectively applied for various v...  
92 International Conference Deep bi-directional long short-term memory based speech enhancement for wind noise reduction 2017-03-17
In this paper, we propose a new recurrent neural network (RNN)-based single-channel speech enhancement framework for off-line wind noise reduction. To adequately represent highly non-stationary charact...