Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
Together AI
Cohere
Novita
Fireworks
Nscale
SambaNova
fal
Hyperbolic
Cerebras
Nebius AI Studio
HF Inference API
Misc
Reset Misc
GRPO
Inference Endpoints
text-generation-inference
Merge
4-bit precision
custom_code
Misc with no match
Eval Results
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
94
Full-text search
Edit filters
Sort: Trending
Active filters:
GRPO
Clear all
HuangXinBa/GRPO
Text Generation
•
Updated
6 days ago
•
28
•
1
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
Updated
Jan 30
•
908
•
20
prithivMLmods/Bellatrix-Tiny-1B-R1
Text Generation
•
Updated
Feb 2
•
20
•
1
mradermacher/Bellatrix-Tiny-1B-R1-GGUF
Updated
Feb 3
•
87
mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF
Updated
Feb 3
•
160
Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF
Text Generation
•
Updated
Feb 3
•
4
Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
4
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
9
Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
1
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_S-GGUF
Text Generation
•
Updated
Feb 3
•
7
Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_M-GGUF
Text Generation
•
Updated
Feb 3
•
10
Triangle104/Bellatrix-Tiny-1B-R1-Q6_K-GGUF
Text Generation
•
Updated
Feb 3
•
10
Triangle104/Bellatrix-Tiny-1B-R1-Q8_0-GGUF
Text Generation
•
Updated
Feb 3
•
5
tecosys/Nutaan-RL1
Reinforcement Learning
•
Updated
Feb 7
•
243
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
Updated
Feb 9
•
87
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
Updated
Feb 9
•
81
alpha-ai/Deep-Reason-SMALL-V0-GGUF
Updated
Feb 26
•
57
•
1
alpha-ai/Deep-Reason-SMALL-V0
Text Generation
•
Updated
Feb 26
•
14
•
2
mradermacher/Deep-Reason-SMALL-V0-GGUF
Updated
Feb 9
•
43
•
2
mradermacher/Deep-Reason-SMALL-V0-i1-GGUF
Updated
Feb 9
•
100
•
1
alpha-ai/qwen2.5-reason-thought-lite-GGUF
Updated
Apr 28
•
39
alpha-ai/qwen2.5-reason-thought-lite
Text Generation
•
Updated
Apr 28
•
8
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF
Updated
Feb 26
•
31
•
1
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
•
Updated
Feb 26
•
11
Daemontatox/Cogito-R1
Text Generation
•
Updated
Feb 19
•
11
•
5
mradermacher/Cogito-R1-GGUF
Updated
Feb 12
•
54
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
•
Updated
Feb 12
•
21
mradermacher/Cogito-R1-i1-GGUF
Updated
Feb 13
•
384
AaryanK/Qwen_2.5_3B_GRPO_Reasoning_XIOSERV
Updated
Feb 17
•
21
•
1
Nitral-AI/Captain-Eris_Violet-GRPO-v0.420
Text Generation
•
Updated
Apr 14
•
160
•
•
21
Previous
1
2
3
4
Next