S4nto
·
AI & ML interests
None yet
Organizations
models
14
S4nto/lora-dpo-finetuned-stage4-sft-_0.1_1e-6_ep-5
Text Generation
•
12B
•
Updated
•
11
S4nto/lora-dpo-finetuned-stage4-sft-0.5-1e-6_ep5
Text Generation
•
12B
•
Updated
•
11
S4nto/lora-dpo-finetuned-stage4-sft-0.1-1e-6_ep-1
Text Generation
•
12B
•
Updated
•
56
S4nto/lora-dpo-finetuned-stage4-sft-0.5-1e-6_ep-1
Text Generation
•
12B
•
Updated
•
34
S4nto/lora-dpo-finetuned-stage4-sft-ichikara
Text Generation
•
12B
•
Updated
•
30
S4nto/lora-dpo-finetuned-model-beta-0.4-rate-1e5-stage2-iter40000-sft
Updated
S4nto/lora-dpo-finetuned-model-beta-0.1-rate-1e5-stage2-iter40000-sft
Text Generation
•
12B
•
Updated
•
30
S4nto/lora-dpo-finetuned-model-beta-0.5-rate-2e6-stage2-iter40000-sft
Text Generation
•
12B
•
Updated
•
15
S4nto/lora-dpo-finetuned-model-beta-0.1-rate-1e6-stage2-iter40000-sft
Text Generation
•
12B
•
Updated
•
11
S4nto/lora-dpo-finetuned-model-beta-0.5-rate-1e6-stage2-iter40000-sft
Text Generation
•
12B
•
Updated
•
11