LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Paper • 2403.11627 • Published Mar 18, 2024
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition Paper • 2405.13870 • Published May 22, 2024
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Paper • 2412.15214 • Published Dec 19, 2024 • 15
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior Paper • 2407.04947 • Published Jul 6, 2024
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration Paper • 2505.20256 • Published May 26 • 17
GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding Paper • 2507.15846 • Published 29 days ago • 130
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Paper • 2508.09138 • Published 7 days ago • 32
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning Paper • 2508.05405 • Published 12 days ago • 61
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search Paper • 2507.15245 • Published 29 days ago • 11
RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration Paper • 2505.03673 • Published May 6 • 1
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete Paper • 2502.21257 • Published Feb 28 • 2