Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper โข 2506.09350 โข Published 6 days ago โข 46
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion Paper โข 2506.08009 โข Published 8 days ago โข 18
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper โข 2506.09113 โข Published 7 days ago โข 79
PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer Paper โข 2505.04622 โข Published May 7 โข 26
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation Paper โข 2505.04512 โข Published May 7 โข 35
Step1X-Edit: A Practical Framework for General Image Editing Paper โข 2504.17761 โข Published Apr 24 โข 88
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Paper โข 2503.21758 โข Published Mar 27 โข 22
Personalize Anything for Free with Diffusion Transformer Paper โข 2503.12590 โข Published Mar 16 โข 44
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Paper โข 2503.10618 โข Published Mar 13 โข 17
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper โข 2503.08638 โข Published Mar 11 โข 66
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper โข 2503.07703 โข Published Mar 10 โข 36
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper โข 2503.07027 โข Published Mar 10 โข 29
VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control Paper โข 2503.05639 โข Published Mar 7 โข 24
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper โข 2503.03751 โข Published Mar 5 โข 22
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes both 1K and 21K pretrained models. โข 13 items โข Updated 6 days ago โข 31
Cosmos-Tokenizer Collection A suite of image and video tokenizers โข 13 items โข Updated 6 days ago โข 40
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr โข Feb 7 โข 153
view article Article State of open video generation models in Diffusers By sayakpaul and 2 others โข Jan 27 โข 53
view article Article Introducing smolagents: simple agents that write actions in code. By m-ric and 2 others โข Dec 31, 2024 โข 1.06k