Interesting papers to see
Manan Shah
cs-mshah
AI & ML interests
Computer Vision
Recent Activity
upvoted
a
paper
3 days ago
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale
upvoted
a
paper
3 days ago
Vidi: Large Multimodal Models for Video Understanding and Editing
upvoted
a
paper
11 days ago
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model