Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-RM-7B-alpha
like
102
Follow
Berkeley-Nest
67
Transformers
PyTorch
berkeley-nest/Nectar
English
llama
reward model
RLHF
RLAIF
text-generation-inference
Inference Endpoints
arxiv:
2203.02155
arxiv:
2301.11270
License:
apache-2.0
Model card
Files
Files and versions
Community
7
Train
Deploy
Use this model
refs/pr/3
Starling-RM-7B-alpha
/
Henrique - Sem título 15 de nov. de 2023 2232 2023-11-15 22_34.m4a
HenriqueMendes
Luciano
fb18d4f
about 1 year ago
download
Copy download link
history
170 kB
This file contains binary data. It cannot be displayed, but you can still
download
it.