Surrey researchers Sign in
Towards Generating Diverse Audio Captions Via Adversarial Training
Journal article   Open access   Peer reviewed

Towards Generating Diverse Audio Captions Via Adversarial Training

Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley and Wenwu Wang
IEEE/ACM transactions on audio, speech, and language processing, Vol.32, pp.3311-3323
21/06/2024

Abstract

Audio captioning cross-modal task deep learning GANs Generators Hybrid power systems Maximum likelihood estimation Measurement reinforcement learning Task analysis Training Semantics
pdf
MeiLSPW_TASLP_20241.46 MBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access

Details

Usage Policy