ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting Paper • 2504.15921 • Published 4 days ago • 4 • 2
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Paper • 2503.11495 • Published Mar 14 • 12 • 2
CoS: Chain-of-Shot Prompting for Long Video Understanding Paper • 2502.06428 • Published Feb 10 • 10 • 2
INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation Paper • 2501.18753 • Published Jan 30 • 3 • 2