Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12 • 72
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published 5 days ago • 48
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper • 2505.16933 • Published 3 days ago • 25
LaViDa: A Large Diffusion Language Model for Multimodal Understanding Paper • 2505.16839 • Published 4 days ago • 10