Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pbevan11
/
Mistral-Nemo-MCAI-SFT-DPO
like
0
Text Generation
Transformers
TensorBoard
Safetensors
pbevan11/multilingual-constitutional-preference-pairs
pbevan11/ultrafeedback_binarized_multilingual
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
Mistral-Nemo-MCAI-SFT-DPO
/
runs
/
Sep30_15-39-35_280ca1cd997c
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
pbevan11
End of training
b490da1
verified
9 months ago
events.out.tfevents.1727711310.280ca1cd997c.3414.0
Safe
12.7 kB
LFS
Training in progress, step 83
9 months ago
events.out.tfevents.1727712969.280ca1cd997c.3414.1
815 Bytes
LFS
End of training
9 months ago