CompassVerifier Collection CompassVerifier: A Unified and Robust Verifier for Large Language Models • 4 items • Updated about 3 hours ago • 3
CompassVerifier Collection CompassVerifier: A Unified and Robust Verifier for Large Language Models • 4 items • Updated about 3 hours ago • 3
Rethinking Verification for LLM Code Generation: From Generation to Testing Paper • 2507.06920 • Published 28 days ago • 28
Coding Triangle: How Does Large Language Model Understand Code? Paper • 2507.06138 • Published 29 days ago • 20
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Paper • 2506.07044 • Published Jun 8 • 110
TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization Paper • 2305.01951 • Published May 3, 2023 • 1
CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries Paper • 2501.01282 • Published Jan 2
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective Paper • 2505.19815 • Published May 26 • 37
Scaling Image and Video Generation via Test-Time Evolutionary Search Paper • 2505.17618 • Published May 23 • 42
Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective Paper • 2505.19815 • Published May 26 • 37
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping Paper • 2505.15612 • Published May 21 • 34
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 280
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Paper • 2503.24377 • Published Mar 31 • 18
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Paper • 2503.24377 • Published Mar 31 • 18
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM Paper • 2503.14478 • Published Mar 18 • 49
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference Paper • 2502.18411 • Published Feb 25 • 75