-
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Paper • 2312.08344 • Published • 13 -
Diffusion Priors for Dynamic View Synthesis from Monocular Videos
Paper • 2401.05583 • Published • 11 -
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Paper • 2401.14405 • Published • 13 -
The Lessons of Developing Process Reward Models in Mathematical Reasoning
Paper • 2501.07301 • Published • 98
James Burgess
jmhb
AI & ML interests
Vision-language models, evaluation, biology applications
Recent Activity
upvoted
a
paper
3 days ago
Visual Agentic Reinforcement Fine-Tuning
commented on
a paper
5 days ago
Unifying Segment Anything in Microscopy with Multimodal Large Language
Model
upvoted
a
paper
5 days ago
Unifying Segment Anything in Microscopy with Multimodal Large Language
Model
Organizations
Collections
1
models
0
None public yet