116 |
International Conference
Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment
|
2019-06-18 |
In this paper, we propose a deep learning (DL)-based parameter enhancement method for a mixed excitation linear prediction (MELP) speech codec in noisy communication environment.Unlike conventional...
|
115 |
International Journal
A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems
|
2019-05-02 |
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...
|
114 |
International Journal
A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems
|
2019-05-02 |
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...
|
113 |
International Conference
Gradient-based active learning query strategy for end-to-end speech recognition
|
2019-02-07 |
In this paper, we propose an effective active learning query strategy for an automatic speech recognition system with the aim of reducing the training cost. Generally, training a deep neural network ...
|
112 |
International Conference
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
|
2019-02-07 |
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...
|
111 |
International Conference
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
|
2019-02-07 |
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...
|
110 |
International Conference
Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems
|
2019-01-24 |
This paper investigates how the perceptual quality of the synthesized speech is affected by reconstruction errors in excitation signals generated by a deep learning-based statistical model. In this fram...
|
109 |
International Journal
A Priori SNR Estimation Using Air- and Bone-Conduction Microphones
|
2019-01-24 |
This paper proposes an a priori signal-to-noise ratio (SNR) estimator using an air-conduction (AC) and a bone-conduction (BC) microphone. Among various ways of combining AC and BC microphones for spee...
|
108 |
International Conference
A Deep Learning-based Stress Detection Algorithm with Speech Signal
|
2018-09-27 |
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...
|
107 |
International Conference
A Deep Learning-based Stress Detection Algorithm with Speech Signal
|
2018-09-27 |
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...
|
106 |
International Journal
Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement
|
2018-07-13 |
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...
|
105 |
International Journal
Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement
|
2018-07-13 |
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...
|
104 |
International Conference
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems
|
2018-06-04 |
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...
|
103 |
International Conference
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems
|
2018-06-04 |
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...
|
102 |
International Conference
Deep Learning-Based Speech Presence Probability Estimation for Noise PSD Estimation in Single-Channel Speech Enhancement
|
2018-03-23 |
In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a ...
|
101 |
International Conference
Deep Learning-Based Speech Presence Probability Estimation for Noise PSD Estimation in Single-Channel Speech Enhancement
|
2018-03-23 |
In single-channel speech enhancement, it is essential to determine noise reduction factors to successfully remove noise while minimizing speech distortion. These factors are typically set by a ...
|
100 |
International Journal
Generic Uniform Search Grid Generation Algorithm for Far-field Source Localization
1
|
2018-02-05 |
In this letter, a generic search grid generation algorithm for far-field source localization (SL) is proposed. Since conventional uniform regular grid structures only consider the resolution of the d...
|
99 |
International Conference
Modeling-by-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System
|
2018-02-05 |
This paper proposes a novel noise compensation algorithm for a glottal excitation model in a deep learning (DL)-based speech synthesis system. To generate high-quality speech synthesis outputs, the bala...
|
98 |
International Conference
DNN-based Wireless Positioning in An Outdoor Environment
|
2018-02-05 |
In this paper, we propose a deep learning based algorithm to estimate the position of an user by utilizing reference signal received power (RSRP) and the location of base stations. To obtain relia...
|
97 |
International Conference
Two Electrode based Healthcare Device for Continuously Monitoring ECG and BIA Signals
|
2017-12-18 |
In this study, we propose an effective wearable device to continuously and reliably monitor electrocardiogram (ECG) and body impedance (BI) that are two most important bio-signals for healthcare applicat...
|