view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 1 day ago β’ 43
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper β’ 2501.18512 β’ Published 6 days ago β’ 24
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 10 items β’ Updated 6 days ago β’ 85
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published 14 days ago β’ 295
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 14 days ago β’ 30
view article Article Yay! Organizations can now publish blog Articles By huggingface β’ 16 days ago β’ 30
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published Jan 4 β’ 90
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper β’ 2501.04682 β’ Published 28 days ago β’ 90
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. β’ 3 items β’ Updated Dec 20, 2024 β’ 8
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper β’ 2412.19723 β’ Published Dec 27, 2024 β’ 82