Transformers
PyTorch
English
llama
reward model
RLHF
RLAIF
text-generation-inference
banghua's picture
Update README.md
abf765b verified