arxiv:2502.01341
Abhay Puri
abhaypuri
AI & ML interests
LLM, Vision, diffusion models
Recent Activity
authored
a paper
about 14 hours ago
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal
Understanding
authored
a paper
about 2 months ago
BigDocs: An Open and Permissively-Licensed Dataset for Training
Multimodal Models on Document and Code Tasks
Organizations
None yet