Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Stan Lei's picture
2 2 4

Stan Lei

leiwx52
CCP6's profile picture erinner1's profile picture 21world's profile picture
·
  • StanLei52

AI & ML interests

Vision and Language

Organizations

ARC Lab, Tencent PCG's profile picture ByteDance Seed's profile picture

upvoted 2 papers 3 months ago

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published Apr 14 • 27

The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

Paper • 2504.10462 • Published Apr 14 • 15
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs