EVA: Efficient Reinforcement Learning for End-to-End Video Agent Paper • 2603.22918 • Published 8 days ago • 42
Rubrics as an Attack Surface (RIPD) Collection This collection releases the official artifacts accompanying “Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges.” • 10 items • Updated about 1 month ago • 1
PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency Paper • 2602.16745 • Published Feb 18 • 8