-
-
-
-
-
-
Inference Providers
Active filters:
RLHF
aaditya/Llama3-OpenBioLLM-70B
Text Generation
•
Updated
•
26.5k
•
•
464
NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF
7B
•
Updated
•
22k
•
80
aaditya/Llama3-OpenBioLLM-8B
Text Generation
•
Updated
•
40.9k
•
•
207
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
769
•
25
TheBloke/Nous-Hermes-2-Mixtral-8x7B-DPO-GGUF
47B
•
Updated
•
7.17k
•
65
bartowski/OpenBioLLM-Llama3-8B-exl2
Text Generation
•
Updated
•
10
•
1
bartowski/Athene-V2-Agent-GGUF
Text Generation
•
73B
•
Updated
•
1.14k
•
8
NiuTrans/GRAM-Qwen3-1.7B-RewardModel
2B
•
Updated
•
48
•
3
NiuTrans/GRAM-Qwen3-14B-RewardModel
15B
•
Updated
•
100
•
3
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
1.18k
•
12
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
71
•
5
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
6.9k
•
•
223
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
43
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
21
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
23
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
26
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
24
•
4
llm-blender/pair-ranker
Text Ranking
•
0.4B
•
Updated
•
19
•
3
nicholasKluge/RewardModelPT
Text Classification
•
0.1B
•
Updated
•
78
nicholasKluge/RewardModel
Text Classification
•
0.1B
•
Updated
•
157
•
•
1
fb700/chatglm-fitness-RLHF
Updated
•
268
fb700/Bofan-chatglm-Best-lora
Updated
•
19
•
11
kubernetes-bad/Ligma-L2-13b
Updated
•
9
•
3
llm-blender/PairRM
Text Generation
•
0.4B
•
Updated
•
4.79k
•
201
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
7B
•
Updated
•
6.42k
•
555
berkeley-nest/Starling-RM-7B-alpha
Updated
•
25
•
102
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
•
40
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
12
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
20
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
•
23
•
1