view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • 4 days ago • 496
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30 • 260
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others • May 23 • 144
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 295
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 62
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models Paper • 2502.01142 • Published Feb 3 • 24
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 145
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… By Xenova • Oct 22, 2024 • 74
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 405
view article Article Introducing the Open FinLLM Leaderboard By QianqianXie1994 and 12 others • Oct 4, 2024 • 78
Law of the Weakest Link: Cross Capabilities of Large Language Models Paper • 2409.19951 • Published Sep 30, 2024 • 55
view article Article Accelerating PyTorch distributed fine-tuning with Intel technologies By juliensimon • Nov 19, 2021 • 1
view article Article Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 1 By juliensimon • Jan 2, 2023 • 3