Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hanyinwang
/
layer-project-reward-model
like
0
PEFT
Safetensors
hanyinwang/layer-project-reward-training
English
trl
reward-trainer
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
layer-project-reward-model
Commit History
Update README.md
a441430
verified
hanyinwang
commited on
May 3, 2024
Update README.md
d5a3dcf
verified
hanyinwang
commited on
May 3, 2024
Update README.md
c79a1aa
verified
hanyinwang
commited on
May 3, 2024
Upload data_reward_model_training.csv
7acfd95
verified
hanyinwang
commited on
May 3, 2024
End of training
1bce8c1
verified
hanyinwang
commited on
May 2, 2024
initial commit
06f8cba
verified
hanyinwang
commited on
May 2, 2024