Don't Judge Before You CLIP: A Unified Approach for Perceptual Tasks Paper • 2503.13260 • Published Mar 17 • 2
The Wisdom of a Crowd of Brains: A Universal Brain Encoder Paper • 2406.12179 • Published Jun 18, 2024 • 2
Pathways on the Image Manifold: Image Editing via Video Generation Paper • 2411.16819 • Published Nov 25, 2024 • 37
Paint by Inpaint: Learning to Add Image Objects by Removing Them First Paper • 2404.18212 • Published Apr 28, 2024 • 30
Paint by Inpaint: Learning to Add Image Objects by Removing Them First Paper • 2404.18212 • Published Apr 28, 2024 • 30
FuseCap: Leveraging Large Language Models to Fuse Visual Data into Enriched Image Captions Paper • 2305.17718 • Published May 28, 2023 • 2