Science-T2I Collection Addressing Scientific Illusions in Image Synthesis • 9 items • Updated about 21 hours ago • 2
MoCha: Towards Movie-Grade Talking Character Synthesis Paper • 2503.23307 • Published 6 days ago • 85
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 9 days ago • 77
Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 15 days ago • 33
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 18 days ago • 112
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 18 days ago • 134
Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption Paper • 2503.09279 • Published 24 days ago • 5
Autoregressive Image Generation with Randomized Parallel Decoding Paper • 2503.10568 • Published 23 days ago • 8
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Paper • 2503.05978 • Published 29 days ago • 34
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published about 1 month ago • 20
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Mar 4 • 68