AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper • 2505.24863 • Published 6 days ago • 83
Table-R1: Inference-Time Scaling for Table Reasoning Paper • 2505.23621 • Published 7 days ago • 88
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published 8 days ago • 114
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Paper • 2505.21497 • Published 9 days ago • 91
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 16 days ago • 135
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published 15 days ago • 98
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 16 days ago • 129
AdaptThink: Reasoning Models Can Learn When to Think Paper • 2505.13417 • Published 17 days ago • 78
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published 21 days ago • 118
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper • 2505.09568 • Published 22 days ago • 90
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper • 2505.04921 • Published 29 days ago • 175
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other • Apr 30 • 162
DeepCritic: Deliberate Critique with Large Language Models Paper • 2505.00662 • Published May 1 • 53
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published Apr 25 • 43
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c • Apr 25 • 267