ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper โข 2502.18364 โข Published Feb 25 โข 36
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper โข 2502.17258 โข Published Feb 24 โข 79
Sleeping 97 97 CountGD_Multi-Modal_Open-World_Counting ๐ Count objects in images using text or visual examples