CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers Paper • 2502.06527 • Published about 17 hours ago • 1
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 21 days ago • 60
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Paper • 2502.04299 • Published 5 days ago • 13
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 15 days ago • 334
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 203
bartowski/cognitivecomputations_Dolphin3.0-Mistral-24B-GGUF Text Generation • Updated 4 days ago • 2.55k • 7