Surrey researchers Sign in
Text-Driven Foley Sound Generation With Latent Diffusion Model
Preprint   Open access

Text-Driven Foley Sound Generation With Latent Diffusion Model

arXiv.org
Cornell University Library, arXiv.org
Detection and Classification of Acoustic Scenes and Events 2023 (Tampere, Finland, 21/09/2023 - 22/09/2023)
18/09/2023

Abstract

Ablation Clips Coders Embedding Model-based systems Multimedia Sound generation Waveforms
url
https://arxiv.org/pdf/2306.10359.pdfView
Preprint (Author's original)CC BY V4.0 Open

Metrics

28 Record Views

Details

Usage Policy