Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published about 1 month ago • 88
DreamLLM Collection [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation (https://arxiv.org/abs/2309.11499) • 6 items • Updated Mar 22, 2024 • 3
Unleashing Vecset Diffusion Model for Fast Shape Generation Paper • 2503.16302 • Published Mar 20 • 44
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey Paper • 2503.12605 • Published Mar 16 • 34
SoFar Collection Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation • 5 items • Updated Feb 24 • 3