Surrey researchers Sign in
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
Conference paper   Open access

Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning

Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kılıç, Mark D. Plumbley and Wenwu Wang
Proceedings of the 24th Annual Conference of the International Speech Communication Association, INTERSPEECH (INTERSPEECH 2023), pp.4164-4168
International Speech Communication Association (ISCA)
INTERSPEECH 2023 (Dublin, Ireland, 20/08/2023 - 24/08/2023)
2023

Abstract

fused feature high-dimensional feature dual transformer decoder audio captioning PANNS
pdf
Dual Transformer Decoder based Features Fusion Network - AAM582.69 kBDownloadView
Author's Accepted Manuscript Open Access

Metrics

1 Record Views

Details

Usage Policy