Running 330 330 Qwen2.5 Omni 7B Demo ๐ Generate text and speech responses from text, images, or audio input
Video-Guided Foley Sound Generation with Multimodal Controls Paper โข 2411.17698 โข Published Nov 26, 2024 โข 10
MoCha: Towards Movie-Grade Talking Character Synthesis Paper โข 2503.23307 โข Published Mar 30 โข 136
Running 856 856 Kolors Character With Flux ๐คน Kolors Character to keep character developed with Flux
Running 588 588 Kolors Portrait With Flux ๐ค Kolors Portrait to keep face identity developed with Flux