WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 13 days ago • 66
FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing Paper • 2512.01755 • Published 29 days ago • 1
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27 • 215
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process Paper • 2511.01718 • Published Nov 3 • 6