Learning Dense Hand Contact Estimation from Imbalanced Data
Abstract
A framework addresses dense hand contact estimation from imbalanced datasets using balanced contact sampling and a vertex-level class-balanced loss to handle class and spatial imbalance issues.
Hands are essential to human interaction, and understanding contact between hands and the world can promote comprehensive understanding of their function. Recently, there have been growing number of hand interaction datasets that cover interaction with object, other hand, scene, and body. Despite the significance of the task and increasing high-quality data, how to effectively learn dense hand contact estimation remains largely underexplored. There are two major challenges for learning dense hand contact estimation. First, there exists class imbalance issue from hand contact datasets where majority of samples are not in contact. Second, hand contact datasets contain spatial imbalance issue with most of hand contact exhibited in finger tips, resulting in challenges for generalization towards contacts in other hand regions. To tackle these issues, we present a framework that learns dense HAnd COntact estimation (HACO) from imbalanced data. To resolve the class imbalance issue, we introduce balanced contact sampling, which builds and samples from multiple sampling groups that fairly represent diverse contact statistics for both contact and non-contact samples. Moreover, to address the spatial imbalance issue, we propose vertex-level class-balanced (VCB) loss, which incorporates spatially varying contact distribution by separately reweighting loss contribution of each vertex based on its contact frequency across dataset. As a result, we effectively learn to predict dense hand contact estimation with large-scale hand contact data without suffering from class and spatial imbalance issue. The codes will be released.
Community
We propose HACO, a framework for dense hand contact estimation that addresses class and spatial imbalance challenges in training on large-scale datasets.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation (2025)
- How Do I Do That? Synthesizing 3D Hand Motion and Contacts for Everyday Interactions (2025)
- GRACE: Estimating Geometry-level 3D Human-Scene Contact from 2D Images (2025)
- PICO: Reconstructing 3D People In Contact with Objects (2025)
- BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting (2025)
- InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation (2025)
- PartHOI: Part-based Hand-Object Interaction Transfer via Generalized Cylinders (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper