Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
Joya Chen PRO
chenjoya
AI & ML interests
Video LLM
Recent Activity
upvoted
a
paper
3 days ago
Draw-In-Mind: Learning Precise Image Editing via Chain-of-Thought
Imagination
liked
a Space
18 days ago
tencent/SongGeneration