Agentic Reasoning for Large Language Models Paper β’ 2601.12538 β’ Published 18 days ago β’ 190 β’ 6
TimeBill: Time-Budgeted Inference for Large Language Models Paper β’ 2512.21859 β’ Published Dec 26, 2025 β’ 25 β’ 4
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper β’ 2512.15603 β’ Published Dec 17, 2025 β’ 65 β’ 9
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper β’ 2512.15603 β’ Published Dec 17, 2025 β’ 65 β’ 9
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper β’ 2512.02556 β’ Published Dec 2, 2025 β’ 255 β’ 6
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights Paper β’ 2512.01816 β’ Published Dec 1, 2025 β’ 93 β’ 5
AInstein: Assessing the Feasibility of AI-Generated Approaches to Research Problems Paper β’ 2510.05432 β’ Published Oct 6, 2025 β’ 7 β’ 4
Intern-S1: A Scientific Multimodal Foundation Model Paper β’ 2508.15763 β’ Published Aug 21, 2025 β’ 268 β’ 6
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper β’ 2504.10479 β’ Published Apr 14, 2025 β’ 306 β’ 10
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Paper β’ 2503.07459 β’ Published Mar 10, 2025 β’ 16 β’ 3
ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks Paper β’ 2503.06885 β’ Published Mar 10, 2025 β’ 4 β’ 3
MinorBench: A hand-built benchmark for content-based risks for children Paper β’ 2503.10242 β’ Published Mar 13, 2025 β’ 5 β’ 3
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Paper β’ 2503.07459 β’ Published Mar 10, 2025 β’ 16 β’ 3
FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation Paper β’ 2503.06680 β’ Published Mar 9, 2025 β’ 20 β’ 7
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning Paper β’ 2503.10291 β’ Published Mar 13, 2025 β’ 36 β’ 3
Charting and Navigating Hugging Face's Model Atlas Paper β’ 2503.10633 β’ Published Mar 13, 2025 β’ 92 β’ 6