Abhay Puri's picture

Abhay Puri

abhaypuri

AI & ML interests

LLM, Vision, diffusion models

Recent Activity

authored a paper about 14 hours ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

authored a paper about 2 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

View all activity

Organizations

None yet

Papers 2

arxiv:2502.01341

arxiv:2412.04626

models

None public yet

datasets

None public yet