Abhay Puri's picture

Abhay Puri

abhaypuri

AI & ML interests

LLM, Vision, diffusion models

Recent Activity

authored a paper about 16 hours ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

authored a paper about 2 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

View all activity

Organizations

None yet

abhaypuri's activity

authored a paper about 16 hours ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published 2 days ago • 29

authored a paper about 2 months ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 13