Logo image
Open Research University homepage
Surrey researchers Sign in
AUDIO CAPTIONING TRANSFORMER
Conference proceeding   Open access

AUDIO CAPTIONING TRANSFORMER

XINHAO MEI, XUBO LIU, QIUSHI HUANG, Mark D. Plumbley and WENWU WANG
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2021),
Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE 2021), 6th (Virtual, 15/11/2021–19/11/2021)
11/2021

Abstract

Transformer sequence-to- sequence model cross-modal task Audio captioning
pdf
camera_ready_ACT353.88 kBDownloadView
Author's Accepted Manuscript Open Access
url
http://dcase.community/workshop2021/indexView
Event WebsiteConference website

Metrics

19 File views/ downloads
123 Record Views

Details

Logo image

Usage Policy