Logo image
Open Research University homepage
Surrey researchers Sign in
WavJourney: Compositional Audio Creation with Large Language Models
Journal article   Open access

WavJourney: Compositional Audio Creation with Large Language Models

Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Qiushi Huang, Meng Cui, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, …
IEEE Transactions on Audio, Speech and Language Processing, Vol.33, pp.2830-2844
03/06/2025

Abstract

Artificial intelligence audio generation audio synthesis computational creativity Computational modeling Electronic mail Large language models large language models (LLMs) Pipelines Production Program processors Speech processing Training Acoustics
pdf
WavJourney_TASLP_Camera_Ready21.62 MBDownloadView
Author's Accepted Manuscript CC BY V4.0 Open Access

Metrics

1 File views/ downloads
6 Record Views

Details

Logo image

Usage Policy