Logo image
Open Research University homepage
Surrey researchers Sign in
StableTalk: Advancing Audio-to-Talking Face Generation with Stable Diffusion and Vision Transformer
Book chapter

StableTalk: Advancing Audio-to-Talking Face Generation with Stable Diffusion and Vision Transformer

Pattern Recognition, pp.271-286
Lecture Notes in Computer Science, Springer Nature Switzerland
03/12/2024

Abstract

Audio-to-Talking Face Generation Denoising Diffusion Implicit Model Latent Diffusion Re-attention Vision Transformer

Metrics

13 Record Views

Details

Logo image

Usage Policy