DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 6 days ago • 226
view article Article Yay! Organizations can now publish blog Articles By huggingface • 7 days ago • 30
view article Article Alpine Agent: An AI Agent to Navigate Your Winter Mountain Adventures By florentgbelidji • 11 days ago • 3
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • 13 days ago • 40
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 13 days ago • 268
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 20 days ago • 249
view article Article Python Is All You Need? Introducing Dria-Agent-α By andthattoo • 17 days ago • 22
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 20 days ago • 81
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 81
view article Article 🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • 25 days ago • 39
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ By Sri-Vigneshwar-DJ • 23 days ago • 5
A New Approach for Explainable Multiple Organ Annotation with Few Data Paper • 1912.12932 • Published Dec 30, 2019 • 1
view article Article 🇪🇺✍️ EU AI Act: Systemic Risks in the First CoP Draft Comments ✍️🇪🇺 By yjernite • Dec 12, 2024 • 13