Papers

Selection of spectral compressive operator for vector Taylor series-based model adaptation in noisy environments

International Journal
2011~2015
작성자
이진영
작성일
2014-05-01 22:02
조회
287
Authors : Soonho Baek, Hong-Goo Kang

Year : 2014

Publisher / Conference : The Journal of the Acoustical Society of America

Volume : 135

This letter investigates the impact of spectral compression on the vector Taylor series-based model adaptation algorithm. Unlike mel-frequency cepstral coefficients obtained by the logarithmic compression, the fractional power compression is used for extracting features. Since the relationship between acoustic models for clean and noisy speech depends on nonlinearity of the spectrum, it is important to select an appropriate compressive operator in the model adaptation. In this letter, the dependency of spectral nonlinearity on the speech recognition system is analyzed in various noisy environments. Experimental results confirm that the replacement of the compressive operator improves the performance of the model adaptation.
전체 326
246 Domestic Conference 양해민, 변경근, 강홍구 "RTCP를 이용한 심층 신경망 기반 음질평가 점수 대역 분별 알고리즘" in 한국음향학회 춘계학술대회, 2016
245 Domestic Conference 김글빛, 이진규, 강홍구 "문장종속 화자검증 시스템을 위한 비음수 행렬 분해 기반 잡음 제거" in 한국음향학회 춘계학술대회, 2016
244 Domestic Conference 김진섭, 주영선, 강홍구(연세대학교), 장인선, 안충현(한국전자통신연구원) "음향 모델 성능 개선을 위한 피치 동기화 기반의 DNN-TTS 시스템" in 한국음향학회 춘계학술대회, 2016
243 International Conference Hyeongi Moon, Gyutae Park, Yeong-cheol Park, Dae Hee Youn "A Phase-Matched Exponential Harmonic Weighting for Improved Sensation of Virtual Bass" in 140th Convention of Audio Engineering Society, pp.9544, 2016
242 International Conference Il-eun Kwak, Hong-Goo Kang "Robust formant features for speaker verification in the lombard effect" in APSIPA, pp.114-118, 2015
241 International Journal Ho Seon Shin, Tim Fingscheidt, Hong-Goo Kang "A Priori SNR Estimation Using Air- and Bone-Conduction Microphones" in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue 11, pp.2015-2025, 2015
240 International Conference Hyeonjoo Kang, JeeSok Lee, Soonho Baek, Hong-Goo Kang "Systematic Integration of Acoustic Echo Canceller and Noise Reduction Modules for Voice Communication Systems" in INTERSPEECH, 2015
239 International Conference Kyungguen Byun, Eunwoo Song, Hong-goo Kang "A constrained two-layer compression technique for ECG waves" in Enegineering in Medicine and Biology Society (EMBC), 2015
238 International Conference Eunwoo Song, Hong-Goo Kang "Deep Neural Network-Based Statistical Parametric Speech Synthesis System Using Improved Time-Frequency Trajectory Excitation Mo" in INTERSPEECH, 2015
237 International Journal Taegyu Lee, Hyun Oh Oh, Jeongil Seo, Young-Cheol Park, Dae Hee Youn "Scalable Multiband Binaural Renderer for MPEG-H 3D Audio" in IEEE Journal of Selected Topics in Signal Processing, vol.9, issue 5, pp.907-920, 2015