Running 531 531 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
UNIVA-Bllossom/DeepSeek-llama3.3-Bllossom-70B Text Generation • Updated 26 days ago • 3.43k • 50
UNIVA-Bllossom/DeepSeek-llama3.1-Bllossom-8B Text Generation • Updated 27 days ago • 7.76k • 38
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 344