Surrey researchers Sign in
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
Conference paper   Open access

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention

Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, Lilian H. Tang, …
Proceedings of the 24th Annual Conference of the International Speech Communication Association, INTERSPEECH (INTERSPEECH 2023), pp.2838-2842
International Speech Communication Association (ISCA)
INTERSPEECH 2023 (Dublin, Ireland, 20/08/2023 - 24/08/2023)
2023

Abstract

audio-visual learning multimodal learning audio captioning attention mechanism
pdf
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention - AAM524.99 kBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access

Metrics

1 Record Views

Details

Usage Policy