Transition Models: Rethinking the Generative Learning Objective Paper • 2509.04394 • Published 2 days ago • 18
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing Paper • 2508.09192 • Published 30 days ago • 30
Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts Paper • 2508.07785 • Published 26 days ago • 25