Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection
Abstract
A framework using sketches and prototypical setup achieves few-shot keypoint detection across different styles and classes.
Keypoint detection, integral to modern machine perception, faces challenges in few-shot learning, particularly when source data from the same distribution as the query is unavailable. This gap is addressed by leveraging sketches, a popular form of human expression, providing a source-free alternative. However, challenges arise in mastering cross-modal embeddings and handling user-specific sketch styles. Our proposed framework overcomes these hurdles with a prototypical setup, combined with a grid-based locator and prototypical domain adaptation. We also demonstrate success in few-shot convergence across novel keypoints and classes through extensive experiments.
Community
Keypoint detection, an essential element of modern machine perception, becomes quite challenging in a few-shot scenario. While few-shot keypoint learning is common in intra-class scenarios, the strategies to adapt to novel keypoints on images of different classes are scarce. This work takes this challenge a notch higher. It establishes a cross-modal framework that employs a source-free strategy to use sketch data as support to detect keypoints on photos without needing access to the original image training data.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper