A Dual Audio Transcoding Algorithm for Digital Multimedia Broadcasting Services

International Conference
2006-05-20 22:33
Authors : Kyoung Ho Bang, Young Cheol Park, Dae Hee Youn

Year : 2006

Publisher / Conference : 120th Convention of Audio Engineering Society

Page : 6814

In this paper, we propose a dual audio transcoding algorithm to service high quality audio streams using a broadcasting network comprising heterogeneous audio formats. As two typical cases, audio transcodings from TDTV to T-DMB and S-DMB services are considered. While the Korean DTV audio standard employs the Dolby AC-3, the Korean T-DMB and Korean S-DMB services use the MPEG-4 BSAC and the MPEG-4 HE-AAC audio coding technologies, respectively. In the proposed algorithm, the bit allocation information of AC-3 is reused in the process of BSAC and HE-AAC encodings and the nested loops are reestablished as two independent loops, which saves significant amount of computational cost. In overall, the transcoding algorithm can save about 65% of computational cost for the BSAC encoding and 31% of HE-AAC encoding. Subjective quality evaluations show that the proposed algorithm has mean diffgrades of -0.02 and -0.01 relative to the tandem method. Due to its computational simplicity and effective performance, the proposed algorithm is suitable for the mobile multimedia services.
전체 344
344 International Journal Zhenyu Piao, Hyungseob Lim, Miseul Kim, Hong-goo Kang "PDF-NET: Pitch-adaptive Dynamic Filter Network for Intra-gender Speaker Verification" in APSIPA ASC, 2023
343 International Conference WooSeok Ko, Seyun Um, Zhenyu Piao, Hong-goo Kang "Consideration of Varying Training Lengths for Short-Duration Speaker Verification" in APSIP ASC, 2023
342 International Journal Hyungchan Yoon, Changhwan Kim, Seyun Um, Hyun-Wook Yoon, Hong-Goo Kang "SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems" in IEEE Signal Processing Letters, vol.30, pp.593-597, 2023
341 International Conference Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang "BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0" in The IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), 2023
340 International Conference Seyun Um, Jihyun Kim, Jihyun Lee, Hong-Goo Kang "Facetron: A Multi-speaker Face-to-Speech Model based on Cross-Modal Latent Representations" in EUSIPCO, 2023
339 International Conference Hejung Yang, Hong-Goo Kang "Feature Normalization for Fine-tuning Self-Supervised Models in Speech Enhancement" in INTERSPEECH, 2023
338 International Conference Jihyun Kim, Hong-Goo Kang "Contrastive Learning based Deep Latent Masking for Music Source Seperation" in INTERSPEECH, 2023
337 International Conference Woo-Jin Chung, Doyeon Kim, Soo-Whan Chung, Hong-Goo Kang "MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion" in INTERSPEECH, 2023
336 International Conference Hyungchan Yoon, Seyun Um, Changhwan Kim, Hong-Goo Kang "Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech" in INTERSPEECH, 2023
335 International Conference Hyungchan Yoon, Changhwan Kim, Eunwoo Song, Hyun-Wook Yoon, Hong-Goo Kang "Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech" in INTERSPEECH, 2023