Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
haihp02
/
gemma-2-2b-it-chinese-kyara-dpo-b015cb36-862a-4b53-ad58-e97e43a4ce69-dpo-tuned-only
like
0
Transformers
Safetensors
Generated from Trainer
trl
sft
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
gemma-2-2b-it-chinese-kyara-dpo-b015cb36-862a-4b53-ad58-e97e43a4ce69-dpo-tuned-only
Commit History
End of training
2722195
verified
haihp02
commited on
21 days ago
initial commit
611948c
verified
haihp02
commited on
21 days ago