Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
fal
Together AI
Fireworks
Replicate
SambaNova
HF Inference API
Misc
Reset Misc
Inference Endpoints
open-r1
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
8-bit precision
Misc with no match
Eval Results
Merge
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
147
Full-text search
Edit filters
Sort: Trending
Active filters:
open-r1
Clear all
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-i1-GGUF
Updated
11 days ago
•
798
yeshsurya/Qwen2.5-7B-Math-with_50stepGRPO
Text Generation
•
Updated
3 days ago
•
24
mradermacher/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math-GGUF
Updated
11 days ago
•
1.03k
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-GGUF
Updated
11 days ago
•
1.76k
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
Updated
11 days ago
•
13
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
Updated
11 days ago
•
35
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
Updated
11 days ago
•
31
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
Updated
11 days ago
•
7
yh-yao/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
9 days ago
•
6
qorbanpour/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
10 days ago
•
3
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
•
Updated
4 days ago
•
65
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
•
Updated
10 days ago
•
24
schwamaths/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
10 days ago
•
6
ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated
5 days ago
schwamaths/Qwen2.5-1.5B-Instruct-Open-R1-GRPO
Text Generation
•
Updated
10 days ago
•
4
Jiawen006/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
9 days ago
•
7
mradermacher/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-GGUF
Updated
9 days ago
•
263
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
7 days ago
•
3
nlxpku/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
7 days ago
•
2
saemin21/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
9 days ago
•
2
JeffP111/Qwen2.5-3B-GRPO-Countdown
Text Generation
•
Updated
8 days ago
•
9
jl1019/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
9 days ago
•
3
zwt963/Qwen2.5-1.5B-Instruct-Open-R1-GRPO
Text Generation
•
Updated
9 days ago
•
14
susumuota/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
4 days ago
•
10
susumuota/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
3 days ago
•
2
calledice666/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
4 days ago
•
2
DominicX/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
8 days ago
•
2
Loong-Ma/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
8 days ago
•
4
bushou/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
6 days ago
•
3
DeeLearning/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
7 days ago
•
26
Previous
1
2
3
4
5
Next