Surrey researchers Sign in
Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7
Preprint   Open access

Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7

Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark Plumbley and Wenwu Wang
arXiv.org
Cornell University Library, arXiv.org
15/09/2023

Abstract

Audio data Coders Large language models Multimedia Sound effects Sound generation
url
https://arxiv.org/pdf/2305.15905.pdfView
Preprint (Author's original)CC BY V4.0 Open

Metrics

23 Record Views

Details

Usage Policy