It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization Paper • 2504.13173 • Published 9 days ago • 17
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published 15 days ago • 121
Towards Physically Plausible Video Generation via VLM Planning Paper • 2503.23368 • Published 27 days ago • 39
FreSca: Unveiling the Scaling Space in Diffusion Models Paper • 2504.02154 • Published 24 days ago • 18
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published 23 days ago • 76
MedSAM2: Segment Anything in 3D Medical Images and Videos Paper • 2504.03600 • Published 22 days ago • 8
TransMamba: Flexibly Switching between Transformer and Mamba Paper • 2503.24067 • Published 26 days ago • 20
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving Paper • 2504.02605 • Published 23 days ago • 44
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published Mar 26 • 49
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy Paper • 2503.19757 • Published Mar 25 • 50
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation Paper • 2503.09641 • Published Mar 12 • 37
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published Mar 18 • 122
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17 • 96