Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

339

Full-text search

Active filters: rlhf

mradermacher/beaver-7b-v3.0-GGUF

Reinforcement Learning • 7B • Updated Apr 1 • 168 • 1

mradermacher/beaver-7b-v1.0-GGUF

Reinforcement Learning • 7B • Updated Apr 5 • 152

loganlin777/mistral-7b-dpo-adapter

tensorblock/mlabonne_NeuralDaredevil-7B-GGUF

7B • Updated 15 days ago • 53

BryanADA/Qwen2.5-3B-cot-zh-tw

Text Generation • 3B • Updated May 23 • 64 • 1

zhuohaoyu/RewardAnything-8B-v1

Text Generation • 8B • Updated 29 days ago • 124 • 2

mradermacher/RewardAnything-8B-v1-GGUF

8B • Updated 29 days ago • 102

Pierizvi/infused-reasoning-phi2

Text Generation • Updated 29 days ago • 81

gCao/mistral-7b-dpo-arena

Updated 5 days ago • 3