Logo image
Open Research University homepage
Surrey researchers Sign in
Efficient Audio Captioning with Encoder-Level Knowledge Distillation
Conference paper   Open access

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Xuenan Xu, Haohe Liu, Mengyue Wu, Wenwu Wang and Mark D. Plumbley
Interspeech 2024, pp.1160-1164
International Speech Communication Association
INTERSPEECH 2024, 2024 (Kos Island, Greece, 01/09/2024–05/09/2024)
01/09/2024

Abstract

encoder-decoder framework knowledge distillation EfficientNet automated audio captioning
pdf
2407.14329v1497.25 kBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access
url
https://interspeech2024.org/View
Event WebsiteConference website

Metrics

79 File views/ downloads
48 Record Views

Details

Logo image

Usage Policy