Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

112

Full-text search

Active filters: GRPO

mradermacher/Captain-Eris_Violet-GRPO-v0.420-i1-GGUF

12B • Updated Feb 18 • 927 • 5

Ihor/Text2Graph-R1-Qwen2.5-0.5b

Text Generation • 0.5B • Updated Jan 30 • 2.49k • • 20

prithivMLmods/Bellatrix-Tiny-1B-R1

Text Generation • 1B • Updated Feb 2 • 13 • • 1

mradermacher/Bellatrix-Tiny-1B-R1-GGUF

1B • Updated Feb 3 • 189

mradermacher/Bellatrix-Tiny-1B-R1-i1-GGUF

1B • Updated Feb 3 • 336

Novaciano/Bellatrix-1B-R1_Erotiquant3_IQ4_XS-GGUF

Text Generation • 1B • Updated Feb 3 • 5

Novaciano/Bellatrix-1B-R1_Erotiquant3_Q5_K_M-GGUF

Text Generation • 1B • Updated Feb 3 • 8

Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_S-GGUF

Text Generation • 1B • Updated Feb 3 • 6

Triangle104/Bellatrix-Tiny-1B-R1-Q4_K_M-GGUF

Text Generation • 1B • Updated Feb 3 • 6

Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_S-GGUF

Text Generation • 1B • Updated Feb 3 • 7

Triangle104/Bellatrix-Tiny-1B-R1-Q5_K_M-GGUF

Text Generation • 1B • Updated Feb 3 • 10

Triangle104/Bellatrix-Tiny-1B-R1-Q6_K-GGUF

Text Generation • 1B • Updated Feb 3 • 4

Triangle104/Bellatrix-Tiny-1B-R1-Q8_0-GGUF

Text Generation • 1B • Updated Feb 3 • 2

tecosys/Nutaan-RL1

Reinforcement Learning • Updated Feb 7 • 528

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF

0.5B • Updated Feb 9 • 81

mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF

0.5B • Updated Feb 9 • 274

alpha-ai/Deep-Reason-SMALL-V0-GGUF

3B • Updated Feb 26 • 35 • 1

alpha-ai/Deep-Reason-SMALL-V0

Text Generation • 3B • Updated Feb 26 • 16 • 2

mradermacher/Deep-Reason-SMALL-V0-GGUF

3B • Updated Feb 9 • 66 • 2

mradermacher/Deep-Reason-SMALL-V0-i1-GGUF

3B • Updated Feb 9 • 187 • 1

alpha-ai/qwen2.5-reason-thought-lite-GGUF

3B • Updated Apr 28 • 15

alpha-ai/qwen2.5-reason-thought-lite

Text Generation • 3B • Updated Apr 28 • 9

alpha-ai/llama-3.2-3B-Reason-Reflect-Lite-GGUF

3B • Updated Feb 26 • 27 • 1

alpha-ai/llama-3.2-3B-Reason-Reflect-Lite

Text Generation • 3B • Updated Feb 26 • 5

mradermacher/Cogito-R1-GGUF

33B • Updated Feb 12 • 206

accuracy-maker/Llama-3.2-1B-GRPO-gsm8k

Text Generation • 1B • Updated Feb 12 • 11 •

mradermacher/Cogito-R1-i1-GGUF

33B • Updated Feb 13 • 980

AaryanK/Qwen_2.5_3B_GRPO_Reasoning_XIOSERV

3B • Updated Feb 17 • 27 • 1

Nitral-AI/Captain-Eris_Violet-GRPO-v0.420

Text Generation • 12B • Updated Apr 14 • 97 • • 22

prithivMLmods/SmolLM2_135M_Grpo_Gsm8k

Text Generation • 0.1B • Updated Feb 17 • 19 • 8