Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Abstract
Follow-Your-Shape framework uses a Trajectory Divergence Map and Scheduled KV Injection to enable precise and controllable shape editing in images while preserving non-target content.
While recent flow-based image editing models demonstrate general-purpose capabilities across diverse tasks, they often struggle to specialize in challenging scenarios -- particularly those involving large-scale shape transformations. When performing such structural edits, these methods either fail to achieve the intended shape change or inadvertently alter non-target regions, resulting in degraded background quality. We propose Follow-Your-Shape, a training-free and mask-free framework that supports precise and controllable editing of object shapes while strictly preserving non-target content. Motivated by the divergence between inversion and editing trajectories, we compute a Trajectory Divergence Map (TDM) by comparing token-wise velocity differences between the inversion and denoising paths. The TDM enables precise localization of editable regions and guides a Scheduled KV Injection mechanism that ensures stable and faithful editing. To facilitate a rigorous evaluation, we introduce ReShapeBench, a new benchmark comprising 120 new images and enriched prompt pairs specifically curated for shape-aware editing. Experiments demonstrate that our method achieves superior editability and visual fidelity, particularly in tasks requiring large-scale shape replacement.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- LORE: Latent Optimization for Precise Semantic Control in Rectified Flow-based Image Editing (2025)
- CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing (2025)
- Training-free Geometric Image Editing on Diffusion Models (2025)
- Transport-Guided Rectified Flow Inversion: Improved Image Editing Using Optimal Transport Theory (2025)
- Stable Score Distillation (2025)
- CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing (2025)
- STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper