번호
309 Domestic Conference 메타러닝을 이용한 SAR 영상 자동표적 인식 2020-07-13
공군의 공대지 작전에서 지상의 물체를 정확하게 식별하는 것은 매우 중요하다. 그러나, 임무 특성상 대부분 높은 고도에서 임무를 수행하기 때문에 조종사가 육안으로 표적을 정확하게 식별하는 것은 어렵고, 구름이나 안개와 같...  
308 International Journal Perfect Match: Self-Supervised Embeddings for Cross-modal Retrieval 2020-05-25
Abstract : This paper proposes a new strategy for learning effective cross-modal joint embeddings using self-supervision. We set up the problem as one of cross-modal retrieval, where the objective is to ...  
307 International Conference Emotional Speech Synthesis with Rich and Granularized Control 2020-04-19
This paper proposes an effective emotion control method for an end-to-end text-to-speech (TTS) system. To flexibly control the distinct characteristic of a target emotion category, it is essential to ...  
306 Domestic Conference 저사양 TV 사운드 설계환경을 위한 IIR 필터 기반 주파수 등화기 2020-04-01
In countries that are developing low-end TVs (eg India, Africa, etc.), the lack of development environment and infrastructure often do not take into account the sound environment of the TV. To solve ...  
305 International Conference Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network 2020-01-31
In this paper, we propose an improved LPCNet vocoder using a linear prediction (LP)-structured mixture density network (MDN).The recently proposed LPCNet vocoder has successfully achieved high-quality ...  
304 Domestic Journal k-평균 알고리즘을 활용한 음성의 대표 감정 스타일 결정 방법 2019-12-17
In this paper, we propose a method to effectively determine the representative style embedding of each emotion class to improve the global style token-based end-to-end speech synthesis system. The emot...  
303 International Conference A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis 2019-11-25
In this paper, we investigate the variation in the performance of a deep learning-based speech synthesis (DLSS) system based on the configuration of output acoustic parameters. Our method is mainly applicable...  
302 International Journal An Effective Style Token Weight Control Technique for End-to-End Emotional Speech Synthesis 2019-08-10
In this letter, we propose a high-quality emotional speech synthesis system, using emotional vector space, i.e., the weighted sum of global style tokens (GSTs). Our previous research verified the feasibilit...  
301 International Journal Dry Electrode-Based Body Fat Estimation System with Anthropometric Data for Use in a Wearable Device 2019-07-18
The bioelectrical impedance analysis (BIA) method is widely used to predict percent bodyfat (PBF). However, it requires four to eight electrodes, and it takes a few minutes to accuratelyobtain the mea...  
300 International Conference Model Order Selection for Wind Noise Reduction in Non-negative Matrix Factorization 2019-06-18
In this paper, we propose a wind noise reduction method based on various types of non-negative matrix factorization (NMF) approaches. Since wind noise has highly non- stationary spectral characterist...  
299 International Conference Emotional Speech Synthesis Based on Style Embedded Tacotron2 Framework 2019-06-18
In this paper, we propose a speech synthesis system that effectively generates multiple types of emotional speech using the concept of global style token (GST); where the emotion-related style informati...  
298 International Conference Excitation-by-SampleRNN Model for Text-to-Speech 2019-06-18
.In this paper, we propose a neural vocoder-based textto-speech (TTS) system that effectively utilizes a source-filter modeling framework. Although neural vocoder algorithms such as SampleRNN and WaveNet ar...  
297 International Conference Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment 2019-06-18
In this paper, we propose a deep learning (DL)-based parameter enhancement method for a mixed excitation linear prediction (MELP) speech codec in noisy communication environment.Unlike conventional...  
296 International Journal A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems 2019-05-02
This paper presents a joint learning algorithm for complex-valued time-frequency (T-F) masks in single-channel speech enhancement systems. Most speech enhancement algorithms operating in a single-channel micro...  
295 International Conference Gradient-based active learning query strategy for end-to-end speech recognition 2019-02-07
In this paper, we propose an effective active learning query strategy for an automatic speech recognition system with the aim of reducing the training cost. Generally, training a deep neural network ...  
294 International Conference Perfect match: Improved cross-modal embeddings for audio-visual synchronisation 2019-02-07
This paper proposes a new strategy for learning powerful cross-modal embeddings for audio-to-video synchronization. Here, we set up the problem as one of cross-modal retrieval, where the objective is to ...  
293 International Conference Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems 2019-01-24
This paper investigates how the perceptual quality of the synthesized speech is affected by reconstruction errors in excitation signals generated by a deep learning-based statistical model. In this fram...  
292 International Journal A Priori SNR Estimation Using Air- and Bone-Conduction Microphones 2019-01-24
This paper proposes an a priori signal-to-noise ratio (SNR) estimator using an air-conduction (AC) and a bone-conduction (BC) microphone. Among various ways of combining AC and BC microphones for spee...  
291 International Conference A Deep Learning-based Stress Detection Algorithm with Speech Signal 2018-09-27
In this paper, we propose a deep learning-based psychological stress detection algorithm using speech signals.With increasing demands for communication between human and intelligent systems, automatic stress de...  
290 Domestic Conference 비학습 데이터 적응화 기법을 이용한 딥러닝 기반 한국어 음성 인식 기술 2018-08-25
제35회 음성통신 및 신호처리 학술대회