Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Xizhou Zhu
Einsiedler
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
6 days ago
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
authored
a paper
19 days ago
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
authored
a paper
3 months ago
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding
View all activity
Organizations
None yet
Papers
10
arxiv:
2503.19757
arxiv:
2503.10291
arxiv:
2501.07783
arxiv:
2412.09604
Expand 10 papers
models
None public yet
datasets
None public yet