Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Sunshine279
/
gammaPO-llama-3-8b-instruct
like
0
Safetensors
princeton-nlp/llama3-ultrafeedback-armorm
llama
alignment-handbook
Generated from Trainer
arxiv:
2506.03690
License:
mit
Model card
Files
Files and versions
Community
main
gammaPO-llama-3-8b-instruct
Commit History
Update README.md
bfab24b
verified
Sunshine279
commited on
28 days ago
Update README.md
a52f733
verified
Sunshine279
commited on
28 days ago
上传 model-00002-of-00004.safetensors
7a5bec9
verified
Sunshine279
commited on
29 days ago
上传 trainer_state.json
d62660a
verified
Sunshine279
commited on
29 days ago
上传 model.safetensors.index.json
0a9fdde
verified
Sunshine279
commited on
29 days ago
上传 all_results.json
0aa6244
verified
Sunshine279
commited on
29 days ago
上传 eval_results.json
b7abf73
verified
Sunshine279
commited on
29 days ago
上传 model-00003-of-00004.safetensors
ae6cd00
verified
Sunshine279
commited on
29 days ago
Update README.md
680e585
verified
Sunshine279
commited on
29 days ago
上传 training_args.bin
9cdcf13
verified
Sunshine279
commited on
29 days ago
上传 README.md
b3642b2
verified
Sunshine279
commited on
29 days ago
上传 generation_config.json
d4f0e5e
verified
Sunshine279
commited on
29 days ago
上传 train_results.json
bac1863
verified
Sunshine279
commited on
29 days ago
上传 tokenizer.json
4fef53f
verified
Sunshine279
commited on
29 days ago
上传 config.json
ce793ff
verified
Sunshine279
commited on
29 days ago
上传 special_tokens_map.json
9914792
verified
Sunshine279
commited on
29 days ago
上传 tokenizer_config.json
e02fb52
verified
Sunshine279
commited on
29 days ago
上传 model-00001-of-00004.safetensors
66db188
verified
Sunshine279
commited on
29 days ago
上传 model-00004-of-00004.safetensors
cabbe9f
verified
Sunshine279
commited on
29 days ago
initial commit
d7711aa
verified
Sunshine279
commited on
29 days ago