Co-Speech Gesture Video Generation
Generate a video from a single image
Generate images from text prompts