Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published 4 days ago • 48
Test-Time Scaling with Reflective Generative Model Paper • 2507.01951 • Published 16 days ago • 85
SingLoRA: Low Rank Adaptation Using a Single Matrix Paper • 2507.05566 • Published 10 days ago • 95
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published 17 days ago • 74
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published 17 days ago • 187
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper • 2506.21506 • Published 22 days ago • 48
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published Jun 16 • 254
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks Paper • 2506.10954 • Published Jun 12 • 51
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published Jun 2 • 114
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper • 2505.24863 • Published May 30 • 95
Table-R1: Inference-Time Scaling for Table Reasoning Paper • 2505.23621 • Published May 29 • 94
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published May 28 • 125
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Paper • 2505.21497 • Published May 27 • 105
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • May 21 • 188
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published May 21 • 103
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published May 20 • 131