Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models Paper • 2507.07104 • Published 7 days ago • 33
Play to Generalize: Learning to Reason Through Game Play Paper • 2506.08011 • Published Jun 9 • 15
De-Diffusion Makes Text a Strong Cross-Modal Interface Paper • 2311.00618 • Published Nov 1, 2023 • 23