Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Sunshine279
/
gammaPO-gemma-2-9b-it
like
1
Safetensors
princeton-nlp/gemma2-ultrafeedback-armorm
gemma2
alignment-handbook
Generated from Trainer
arxiv:
2506.03690
License:
mit
Model card
Files
Files and versions
Community
890c019
gammaPO-gemma-2-9b-it
/
train_results.json
Commit History
上传 train_results.json
12d8677
verified
Sunshine279
commited on
29 days ago