Scalable and Versatile 3D Generation from images
Generate a talking face video from an image and audio