-
-
-
-
-
-
Inference Providers
Active filters:
dpo
Avinaash/beta1.6_LR_2e-05_Epoch1_DPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
ammarnasr/SmolLM-135M-GEC-DPO
Text Generation
•
0.1B
•
Updated
•
4
Avinaash/beta1.4_LR_1.5e-05_Epoch1_DPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/beta1.4_LR_3e-05_Epoch1_DPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/beta1.4_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Magpie-Align/MagpieLM-8B-Chat-v0.1
Text Generation
•
8B
•
Updated
•
872
•
•
23
Avinaash/small_beta1.4_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta1.2_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta2_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta4_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta6_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta10_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta20_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta60_LR_5e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta40_LR_2e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta40_LR_1e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/small_beta40_LR_5e-06_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
Avinaash/full_LR_2e-05_Epoch1_IPO_CuratedDataset
Text Generation
•
0.1B
•
Updated
•
2
twigs/smolm-dpo
Text Generation
•
0.1B
•
Updated
•
2
nomadrp/tp-llama-top200-ws
Updated
•
336
nomadrp/tp-llama-top200-ss
Updated
•
192
ammarnasr/SmolLM-135M-GEC-KL-DPO
Text Generation
•
0.1B
•
Updated
•
3
coscotuff/SLFT_Trials_3
Text Generation
•
0.1B
•
Updated
•
2
coscotuff/SLFT_Trials_4
Text Generation
•
0.1B
•
Updated
•
2
not-lain/smol-gec-dpo
Text Generation
•
0.1B
•
Updated
•
2
coscotuff/SLFT_Trials_5
Text Generation
•
0.1B
•
Updated
•
2
coscotuff/SLFT_Trials_6
Text Generation
•
0.1B
•
Updated
•
2
yhshin1020/smollm-dpo
Text Generation
•
0.1B
•
Updated
•
2
coscotuff/SLFT_Trials_DPO
Text Generation
•
0.1B
•
Updated
•
3
sirsam01/codeit_dpo_model
Text Generation
•
0.1B
•
Updated
•
3