view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? zhangchenxu • Feb 25 • 14
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? zhangchenxu • Feb 25 • 14
TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning Paper • 2505.14625 • Published May 20, 2025 • 13