Logo image
Open Research University homepage
Surrey researchers Sign in
Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities
Journal article   Open access   Peer reviewed

Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities

Jinhua Liang, Xubo Liu, Wenwu Wang, Mark D. Plumbley, Huy Phan and Emmanouil Benetos
IEEE Transactions on Audio, Speech and Language Processing, Vol.33, pp.949-961
2025

Abstract

Audio understanding large language model audio-language learning audio recognition automated audio captioning natural language audio reasoning
pdf
Liang et al_TASLP_20251.79 MBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access
url
https://doi.org/10.1109/TASLPRO.2025.3533375View
Published (Version of record)

Metrics

10 File views/ downloads
36 Record Views

Details

Logo image

Usage Policy