Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Sunshine279
/
gammaPO-llama-3-8b-instruct
like
0
Safetensors
princeton-nlp/llama3-ultrafeedback-armorm
llama
alignment-handbook
Generated from Trainer
arxiv:
2506.03690
License:
mit
Model card
Files
Files and versions
Community
main
gammaPO-llama-3-8b-instruct
/
tokenizer.json
Sunshine279
上传 tokenizer.json
4fef53f
verified
29 days ago
raw
Copy download link
history
contribute
delete
Safe
9.08 MB
File too large to display, you can
check the raw version
instead.