arxiv:2507.12462

SpatialTrackerV2: 3D Point Tracking Made Easy

Published on Jul 16

· Submitted by

Yuxihenry on Jul 17

Upvote

Authors:

Jianyuan Wang ,

Nan Xue ,

Nikita Karaev ,

Bingyi Kang ,

Abstract

SpatialTrackerV2 is a feed-forward 3D point tracking method for monocular videos that integrates point tracking, monocular depth, and camera pose estimation into a unified, end-to-end architecture, achieving high performance and speed.

AI-generated summary

We present SpatialTrackerV2, a feed-forward 3D point tracking method for monocular videos. Going beyond modular pipelines built on off-the-shelf components for 3D tracking, our approach unifies the intrinsic connections between point tracking, monocular depth, and camera pose estimation into a high-performing and feedforward 3D point tracker. It decomposes world-space 3D motion into scene geometry, camera ego-motion, and pixel-wise object motion, with a fully differentiable and end-to-end architecture, allowing scalable training across a wide range of datasets, including synthetic sequences, posed RGB-D videos, and unlabeled in-the-wild footage. By learning geometry and motion jointly from such heterogeneous data, SpatialTrackerV2 outperforms existing 3D tracking methods by 30%, and matches the accuracy of leading dynamic 3D reconstruction approaches while running 50times faster.

View arXiv page View PDF Project page GitHub 507 Add to collection

Community

cherubicxn

Paper author 1 day ago

SpatialTrackerV2 is the first feedforward model for dynamic 3D reconstruction and 3D point tracking — all at once! It reconstructs dynamic scenes and predict pixel-wise 3D motion in seconds.

Project: https://spatialtracker.github.io/
Code&Models: https://github.com/henry123-boy/SpaTrackerV2
Online Demo: https://huggingface.co/spaces/Yuxihenry/SpatialTrackerV2

Yuxihenry

Paper submitter about 21 hours ago

SpatialTrackerV2 is the first feedforward model for dynamic reconstruction and 3D point tracking.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2507.12462 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2507.12462 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2507.12462 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.