Stable Diffusion Image to Video, Synthesized Lauretta Young 1930s voice, Wav2Lip Demo
Channel:
Subscribers:
2,820
Published on ● Video Link: https://www.youtube.com/watch?v=ACqSC4yhPkE
Short test video. Stable Diffusion generated image turned into a video using FFMpeg. Voice is synthesized Lauretta Young; most samples from 1930s movies and radio plays. Audio quality of voice samples is very poor, but rnnoise ML model did a reasonable job cleaning them up. Synced voice to video using Wav2Lip with the wav2lip_gan model.