Logo image
Open Research University homepage
Surrey researchers Sign in
Leveraging Pre-trained BERT for Audio Captioning
Conference proceeding   Open access

Leveraging Pre-trained BERT for Audio Captioning

Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kilic and Wenwu Wang
2022 30th European Signal Processing Conference (EUSIPCO 2022), pp.1145-1149
2022 30th European Signal Processing Conference (EUSIPCO 2022) (Belgrade, Serbia, 29/08/2022–02/09/2022)
11/2022

Abstract

audio captioning BERT Bit error rate deep learning Knowledge engineering language models Natural language processing Neural networks Pre-trained Audio Neural Networks (PANNs) Training Europe Signal Processing
pdf
EUSIPCO2022_Audio_Captioning892.34 kBDownloadView
Author's Accepted Manuscript Open Access
url
https://2022.eusipco.org/View
Event WebsiteConference website
url
Leveraging Pre-trained BERT for Audio CaptioningView
Open

Metrics

105 File views/ downloads
119 Record Views

Details

Logo image

Usage Policy