-
-
-
-
-
-
Inference Providers
Active filters:
dpo
sapienzanlp/Minerva-7B-instruct-v1.0
Text Generation
•
Updated
•
4.37k
•
17
sapienzanlp/Minerva-7B-instruct-v1.0-GGUF
Text Generation
•
Updated
•
156
•
5
mradermacher/SauerkrautLM-v2-14b-DPO-GGUF
Updated
•
26
•
2
robinsmits/Schaapje-2B-Chat-V1.0
Text Generation
•
Updated
•
19
•
4
mradermacher/lambda-qwen2.5-14b-dpo-test-i1-GGUF
jpacifico/Chocolatine-2-14B-Instruct-v2.0.3
Text Generation
•
Updated
•
234
•
12
mradermacher/eeve_dpo-v3-GGUF
mradermacher/eeve_dpo-v3-i1-GGUF
DebateLabKIT/Llama-3.1-Argunaut-1-8B-SPIN
Text Generation
•
Updated
•
34
•
2
mradermacher/Rude-AI-1.5B-GGUF
mradermacher/Samanta-NewGenesis-Gemma2B-DPO-GGUF
suayptalha/EmojiLlama-3.1-8B
Text Generation
•
Updated
•
72
•
•
5
mradermacher/lambda-llama-3-8b-ipo-test-GGUF
tahamajs/llama-3.2-3b-orpo-lora64-4bit-instruct
darkc0de/Xortron2025
Text Generation
•
Updated
•
7.7k
•
8
iRASC/Meerkat-Ko-8B-d6-w5-dpo
Text Generation
•
Updated
•
10
•
1
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO
Text Generation
•
Updated
•
5
•
1
codelion/Qwen3-0.6B-PTS-DPO
Text Generation
•
Updated
•
22
•
1
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
•
Updated
•
1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
8
•
12
daekeun-ml/Llama-2-ko-DPO-13B
Text Generation
•
Updated
•
1.17k
•
19
lewtun/zephyr-7b-dpo-full
Text Generation
•
Updated
•
5
alignment-handbook/zephyr-7b-dpo-full
Text Generation
•
Updated
•
87
•
3
alignment-handbook/zephyr-7b-dpo-qlora
Updated
•
49
•
9
argilla/notus-7b-v1-lora
Text Generation
•
Updated
•
10
•
7
argilla/notus-7b-v1-lora-adapter
Text Generation
•
Updated
•
3
argilla/notus-7b-v1
Text Generation
•
Updated
•
87
•
121
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
•
28
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
•
34
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
•
5