LiveCC-7B-Instruct
A Chain-of-LoRA Agent for Long Video Reasoning
Generate clickable coordinates on a screenshot
Generate images from text prompts
[ECCV 2024] Localizing moments in videos via text queries
Turn video uploads into real-time narration and questions