Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Paper • 2503.21758 • Published 7 days ago • 18
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published 18 days ago • 42
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Paper • 2503.10618 • Published 21 days ago • 17
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published 23 days ago • 60
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published 24 days ago • 34
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper • 2503.07027 • Published 24 days ago • 27
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control Paper • 2503.05639 • Published 27 days ago • 22
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published 29 days ago • 20
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. • 13 items • Updated 8 days ago • 31
Cosmos Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 8 days ago • 40
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 952
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Paper • 2406.02347 • Published Jun 4, 2024 • 3
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 7 days ago • 146