Logo image
Open Research University homepage
Surrey researchers Sign in
AudioLDM 2: Learning holistic audio generation with self-supervised pretraining
Journal article   Open access   Peer reviewed

AudioLDM 2: Learning holistic audio generation with self-supervised pretraining

Haohe Liu, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Qiao Tian, Yuping Wang, Wenwu Wang, Yuxuan Wang and Mark D. Plumbley
IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol.32, pp.2871-2883
13/05/2024

Abstract

audio generation diffusion model self-supervised learning speech synthesis AIGC Acoustics Computer Science Engineering
pdf
TASLP_AudioLDM27.69 MBDownloadView
Author's Accepted Manuscript Open Access
url
https://doi.org/10.1109/TASLP.2024.3399607View
Published (Version of record)

Metrics

28 File views/ downloads
82 Record Views

Details

Logo image

Usage Policy