arxiv:2502.01341
Xiangru Jian
EdwardXJ
ยท
AI & ML interests
None yet
Recent Activity
authored
a paper
about 14 hours ago
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal
Understanding
upvoted
a
paper
about 2 months ago
BigDocs: An Open and Permissively-Licensed Dataset for Training
Multimodal Models on Document and Code Tasks
authored
a paper
about 2 months ago
BigDocs: An Open and Permissively-Licensed Dataset for Training
Multimodal Models on Document and Code Tasks