Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Jiaming Han
csuhan
AI & ML interests
Computer Vision
Recent Activity
authored
a paper
1 day ago
Vision as a Dialect: Unifying Visual Understanding and Generation via
Text-Aligned Representations
updated
a collection
2 days ago
Tar
Organizations
None yet