Papers
arxiv:2508.00599

DPoser-X: Diffusion Model as Robust 3D Whole-body Human Pose Prior

Published on Aug 1
ยท Submitted by Moon-bow on Aug 7
Authors:
,
,
,
,
,
,
,
,
,

Abstract

DPoser-X, a diffusion-based model, addresses the complexity of 3D human poses using variational diffusion sampling and a novel truncated timestep scheduling method, outperforming existing models across various pose benchmarks.

AI-generated summary

We present DPoser-X, a diffusion-based prior model for 3D whole-body human poses. Building a versatile and robust full-body human pose prior remains challenging due to the inherent complexity of articulated human poses and the scarcity of high-quality whole-body pose datasets. To address these limitations, we introduce a Diffusion model as body Pose prior (DPoser) and extend it to DPoser-X for expressive whole-body human pose modeling. Our approach unifies various pose-centric tasks as inverse problems, solving them through variational diffusion sampling. To enhance performance on downstream applications, we introduce a novel truncated timestep scheduling method specifically designed for pose data characteristics. We also propose a masked training mechanism that effectively combines whole-body and part-specific datasets, enabling our model to capture interdependencies between body parts while avoiding overfitting to specific actions. Extensive experiments demonstrate DPoser-X's robustness and versatility across multiple benchmarks for body, hand, face, and full-body pose modeling. Our model consistently outperforms state-of-the-art alternatives, establishing a new benchmark for whole-body human pose prior modeling.

Community

Paper author Paper submitter
This comment has been hidden (marked as Resolved)
Paper author Paper submitter

๐Ÿšจ Revolutionary 3D Human Pose Prior is here!

We introduce DPoser-X โ€” the first diffusion-based robust 3D whole-body human pose prior.

๐Ÿค– Current pose priors like VPoser and NDFs struggle with diversity and realism across body parts.

So we built a diffusion-based pose prior model that:

๐Ÿงฌ Leverages unconditional diffusion models as robust pose priors
๐Ÿ” Solves pose-centric tasks through a unified optimization framework
๐Ÿ“‰ Uses truncated timestep scheduling optimized for pose data
๐ŸŽฏ Employs mixed training strategy for advanced whole-body pose modeling

Result? A versatile prior that works across ALL pose-related tasks.

๐Ÿ“Š Up to 61% improvement across 8 benchmarks, outperforming all existing alternatives.

๐Ÿ“š Paper: https://arxiv.org/abs/2508.00599
๐Ÿ’ป Code: https://github.com/careless-lu/DPoser
๐ŸŒ Project: https://dposer.github.io/
๐ŸŽฅ Demo: https://youtu.be/yzwliadFcX0

๐ŸŽ‰ Accepted as ICCV 2025 Oral!

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2508.00599 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2508.00599 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2508.00599 in a Space README.md to link it from this page.

Collections including this paper 1