Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Carbon Emissions

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Models

95

Full-text search

Active filters: reward model

Qwen/Qwen2.5-Math-RM-72B

Text Classification • Updated Oct 31 • 9.26k • 66

berkeley-nest/Starling-LM-7B-alpha

Text Generation • Updated Mar 20 • 16.8k • 556

berkeley-nest/Starling-RM-7B-alpha

Updated Jul 30 • 49 • 101

Nexusflow/Starling-LM-7B-beta

Text Generation • Updated Apr 3 • 5.58k • 342

bartowski/Starling-LM-7B-beta-GGUF

Text Generation • Updated Mar 20 • 1.13k • 25

johnsnowlabs/JSL-MedMNX-7B

Text Generation • Updated Apr 18 • 2.6k • 4

johnsnowlabs/JSL-MedMNX-7B-SFT

Text Generation • Updated Apr 18 • 2.58k • 3

johnsnowlabs/JSL-MedMNX-7B-v2.0

Text Generation • Updated Apr 22 • 2.65k • 3

jieliu/Storm-7B

Text Generation • Updated Jun 18 • 2.52k • 41

nvidia/Llama3-70B-SteerLM-RM

Updated Jun 19 • 14 • 42

nvidia/Nemotron-4-340B-Reward

Updated Jun 19 • 19 • 113

mradermacher/Storm-7B-i1-GGUF

Updated Aug 2 • 44 • 1

internlm/internlm2-1_8b-reward

Text Classification • Updated Jul 15 • 844 • 10

internlm/internlm2-7b-reward

Text Classification • Updated Jul 15 • 831 • 17

internlm/internlm2-20b-reward

Text Classification • Updated Oct 9 • 1.22k • 22

Qwen/Qwen2-Math-RM-72B

Text Classification • Updated Sep 18 • 99 • 3

nvidia/Llama-3.1-Nemotron-70B-Reward

Updated Oct 15 • 37 • 67

nvidia/Llama-3.1-Nemotron-70B-Reward-HF

Updated Oct 15 • 7.9k • 76

second-state/Llama-3.1-Nemotron-70B-Reward-HF-GGUF

Text Generation • Updated Oct 19 • 335 • 1

gaianet/Llama-3.1-Nemotron-70B-Reward-HF-GGUF

Text Generation • Updated Oct 19 • 198 • 1

yale-nlp/MDCureRM

Updated Nov 22 • 112 • 3

mradermacher/Starling-LM-7B-alpha-GGUF

Updated Nov 4 • 128 • 1

mradermacher/Starling-LM-7B-beta-GGUF

Updated 4 days ago • 148 • 1

mradermacher/Starling-LM-7B-beta-i1-GGUF

Updated 4 days ago • 759 • 1

nicholasKluge/RewardModelPT

Text Classification • Updated Jun 18 • 39

nicholasKluge/RewardModel

Text Classification • Updated Jun 18 • 38

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 6 • 23

fnlp/moss-rlhf-reward-model-7B-en

Updated Jul 13, 2023 • 9

LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 11

LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2

Text Generation • Updated Nov 27, 2023 • 11 • 1