306 |
Domestic Conference
저사양 TV 사운드 설계환경을 위한 IIR 필터 기반 주파수 등화기
|
2020-04-01 |
In countries that are developing low-end TVs (eg
India, Africa, etc.), the lack of development
environment and infrastructure often do not take
into account the sound environment of the TV. To
solve ...
|
305 |
International Conference
Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network
|
2020-01-31 |
In this paper, we propose an improved LPCNet vocoder using a linear prediction (LP)-structured mixture density network (MDN).The recently proposed LPCNet vocoder has successfully achieved high-quality ...
|
304 |
Domestic Journal
k-평균 알고리즘을 활용한 음성의 대표 감정 스타일 결정 방법
|
2019-12-17 |
In this paper, we propose a method to effectively determine the representative style embedding of each emotion class to improve the global style token-based end-to-end speech synthesis system. The emot...
|
303 |
International Conference
A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis
|
2019-11-25 |
In this paper, we investigate the variation in the
performance of a deep learning-based speech synthesis (DLSS)
system based on the configuration of output acoustic parameters.
Our method is mainly applicable...
|
302 |
International Journal
An Effective Style Token Weight Control Technique for End-to-End Emotional Speech Synthesis
|
2019-08-10 |
In this letter, we propose a high-quality emotional speech synthesis system, using emotional vector space, i.e., the weighted sum of global style tokens (GSTs). Our previous research verified the feasibilit...
|
301 |
International Journal
Dry Electrode-Based Body Fat Estimation System with Anthropometric Data for Use in a Wearable Device
|
2019-07-18 |
The bioelectrical impedance analysis (BIA) method is widely used to predict percent bodyfat (PBF). However, it requires four to eight electrodes, and it takes a few minutes to accuratelyobtain the mea...
|
300 |
International Conference
Model Order Selection for Wind Noise Reduction in Non-negative Matrix Factorization
|
2019-06-18 |
In this paper, we propose a wind noise reduction method based on various types of non-negative matrix factorization (NMF) approaches. Since wind noise has highly non- stationary spectral characterist...
|
299 |
International Conference
Emotional Speech Synthesis Based on Style Embedded Tacotron2 Framework
|
2019-06-18 |
In this paper, we propose a speech synthesis system that effectively generates multiple types of emotional speech using the concept of global style token (GST); where the emotion-related style informati...
|
298 |
International Conference
Excitation-by-SampleRNN Model for Text-to-Speech
|
2019-06-18 |
.In this paper, we propose a neural vocoder-based textto-speech (TTS) system that effectively utilizes a source-filter modeling framework. Although neural vocoder algorithms such as SampleRNN and WaveNet ar...
|
297 |
International Conference
Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment
|
2019-06-18 |
In this paper, we propose a deep learning (DL)-based parameter enhancement method for a mixed excitation linear prediction (MELP) speech codec in noisy communication environment.Unlike conventional...
|
296 |
International Journal
A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems
|
2019-05-02 |
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...
|
295 |
International Conference
Gradient-based active learning query strategy for end-to-end speech recognition
|
2019-02-07 |
In this paper, we propose an effective active learning query strategy for an automatic speech recognition system with the aim of reducing the training cost. Generally, training a deep neural network ...
|
294 |
International Conference
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation
|
2019-02-07 |
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...
|
293 |
International Conference
Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems
|
2019-01-24 |
This paper investigates how the perceptual quality of the synthesized speech is affected by reconstruction errors in excitation signals generated by a deep learning-based statistical model. In this fram...
|
292 |
International Journal
A Priori SNR Estimation Using Air- and Bone-Conduction Microphones
|
2019-01-24 |
This paper proposes an a priori signal-to-noise ratio (SNR) estimator using an air-conduction (AC) and a bone-conduction (BC) microphone. Among various ways of combining AC and BC microphones for spee...
|
291 |
International Conference
A Deep Learning-based Stress Detection Algorithm with Speech Signal
|
2018-09-27 |
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...
|
290 |
Domestic Conference
비학습 데이터 적응화 기법을 이용한 딥러닝 기반 한국어 음성 인식 기술
|
2018-08-25 |
제35회 음성통신 및 신호처리 학술대회
|
289 |
Domestic Conference
임베딩 매트릭스를 기반으로 한 비정상적 잡음 제거 알고리즘의 분석과 딥러닝 음질개선 방법들과의 성능비교
|
2018-08-25 |
제35회 음성통신 및 신호처리 학술대회
|
288 |
International Journal
Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement
|
2018-07-13 |
This letter presents a phase-sensitive joint learning algorithm for single-channel speech enhancement. Although a deep learning framework that estimates the time-frequency (T-F) domain ideal ratio masks demo...
|
287 |
International Conference
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems
|
2018-06-04 |
In this paper, we propose a unified training framework for the generation of glottal signals in deep learning (DL)-based parametric speech synthesis systems. The glottal vocoding-based speech synthesis syst...
|