Time Blindness: Why Video-Language Models Can't See What Humans Can? Paper • 2505.24867 • Published 4 days ago • 66
SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem Paper • 2505.21887 • Published 7 days ago • 15
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding Paper • 2502.14949 • Published Feb 20 • 8