-
-
-
-
-
-
Inference Providers
Active filters:
dpo
CharlesLi/OpenELM-1_1B-DPO-full-max-10-reward
Text Generation
•
1B
•
Updated
•
2
SameedHussain/gemma-2-2b-it-Flight-Multi-Turn-V3-DPO
Text Generation
•
Updated
•
2
nomadrp/tp-llama-top200-ws-ss
Updated
•
194
CharlesLi/OpenELM-1_1B-DPO-full-max-6-reward
Text Generation
•
1B
•
Updated
•
2
rdli/rdl-k8s-4bit_incremental_dpo
Text Generation
•
2B
•
Updated
•
2
CharlesLi/OpenELM-1_1B-DPO-full-max-8-reward
Text Generation
•
1B
•
Updated
•
2
CharlesLi/OpenELM-1_1B-DPO-full-max-14-reward
Text Generation
•
1B
•
Updated
•
2
mradermacher/gemma-2-2b-it-Flight-Multi-Turn-V3-DPO-GGUF
3B
•
Updated
•
45
DUAL-GPO-2/zephyr-7b-dpo-0k-15k-i1
QuantFactory/gemma-2-2b-it-Flight-Multi-Turn-V3-DPO-GGUF
3B
•
Updated
•
11
•
1
CharlesLi/OpenELM-1_1B-IPO
Text Generation
•
1B
•
Updated
•
5
CharlesLi/OpenELM-1_1B-SLiC
Text Generation
•
1B
•
Updated
•
2
DUAL-GPO/phi-2-ipo-chatml-lora-i2
DUAL-GPO/phi-2-ipo-chatml-lora-40k-60k-i2
QinLiuNLP/llama3-sudo-dpo-instruct-2epochs-jxkey-system
QuantFactory/L3.1-Celestial-Stone-2x8B-DPO-GGUF
14B
•
Updated
•
27
•
4
tanliboy/lambda-llama-3-8b-dpo-test
Text Generation
•
8B
•
Updated
•
2
DUAL-GPO/zephyr-7b-dpo-0k-15k-i1
samurai-zero/MagpieLM-8B-Chat-v0.1-Q4_K_M-GGUF
8B
•
Updated
•
1
SongTonyLi/Phi-3.5-mini-instruct-SFT-D1_chosen-then-DPO-D2a-dpo-mix-shuffled5
Text Generation
•
4B
•
Updated
•
2
NikolayKozloff/MagpieLM-4B-Chat-v0.1-Q8_0-GGUF
5B
•
Updated
•
1
•
1
NikolayKozloff/MagpieLM-8B-Chat-v0.1-Q8_0-GGUF
8B
•
Updated
•
4
•
1
Felladrin/gguf-Q5_K_M-MagpieLM-4B-Chat-v0.1
5B
•
Updated
•
8
ijohn07/MagpieLM-4B-Chat-v0.1-Q6_K-GGUF
5B
•
Updated
•
1
mradermacher/MagpieLM-8B-Chat-v0.1-GGUF
8B
•
Updated
•
131
ijohn07/MagpieLM-8B-Chat-v0.1-Q4_K_M-GGUF
8B
•
Updated
•
1
mradermacher/MagpieLM-8B-Chat-v0.1-i1-GGUF
8B
•
Updated
•
129
mradermacher/L3.1-Celestial-Stone-2x8B-DPO-GGUF
14B
•
Updated
•
126
•
2
NanQiangHF/llama3.1_8b_dpo_bwgenerator_test
8B
•
Updated
•
2
DUAL-GPO/zephyr-7b-dpo-0k-15k-i1-merged
7B
•
Updated