arxiv:2507.07994

Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection

Published on Jul 10

· Submitted by

Authors:

Abstract

A framework using sketches and prototypical setup achieves few-shot keypoint detection across different styles and classes.

AI-generated summary

Keypoint detection, integral to modern machine perception, faces challenges in few-shot learning, particularly when source data from the same distribution as the query is unavailable. This gap is addressed by leveraging sketches, a popular form of human expression, providing a source-free alternative. However, challenges arise in mastering cross-modal embeddings and handling user-specific sketch styles. Our proposed framework overcomes these hurdles with a prototypical setup, combined with a grid-based locator and prototypical domain adaptation. We also demonstrate success in few-shot convergence across novel keypoints and classes through extensive experiments.

View arXiv page View PDF Add to collection

Community

maitysubhajit

Paper submitter about 5 hours ago

Keypoint detection, an essential element of modern machine perception, becomes quite challenging in a few-shot scenario. While few-shot keypoint learning is common in intra-class scenarios, the strategies to adapt to novel keypoints on images of different classes are scarce. This work takes this challenge a notch higher. It establishes a cross-modal framework that employs a source-free strategy to use sketch data as support to detect keypoints on photos without needing access to the original image training data.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2507.07994 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2507.07994 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2507.07994 in a Space README.md to link it from this page.