Logo image
Open Research University homepage
Surrey researchers Sign in
PFCA-Net: Pyramid Feature Fusion and Cross Content Attention Network for Automated Audio Captioning
Conference paper   Open access

PFCA-Net: Pyramid Feature Fusion and Cross Content Attention Network for Automated Audio Captioning

Jianyuan Sun, Wenwu Wang and Mark D. Plumbley
Interspeech 2024
International Speech Communication Association (ISCA)
Interspeech 2024 (Kos Island, Greece, 01/09/2024–05/09/2024)
01/09/2024

Abstract

high-dimensional repre- sentation cross-context attention network Pyramid feature fusion
pdf
Interspeech_Paper_Kit_final_2024827.84 kBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access
url
https://interspeech2024.org/View
Event WebsiteConference website

Metrics

100 File views/ downloads
79 Record Views

Details

Logo image

Usage Policy