Understanding Generative AI Capabilities in Everyday Image Editing Tasks
Paper
โข
2505.16181
โข
Published
โข
20
Score image-text similarity using CLIP or SigLIP models
Segment images based on text prompts
Identify and mask objects in images using text prompts
Generate correspondences between images
Explore images from ImageNet-Hard dataset