view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany and 1 other • 12 days ago • 65
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • 19 days ago • 53
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Paper • 2505.15966 • Published 25 days ago • 51
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana • 20 days ago • 44
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation Paper • 2505.14640 • Published 26 days ago • 14
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 26 days ago • 151
view article Article Microsoft and Hugging Face expand collaboration By jeffboudier and 2 others • 28 days ago • 21
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova • about 1 month ago • 29
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others • May 15 • 113
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 437
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? By danaaubakirova and 6 others • May 11 • 63
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control By danaaubakirova and 3 others • Feb 4 • 162
view article Article Gotchas in Tokenizer Behavior Every Developer Should Know By qgallouedec • Apr 18 • 37