Max
reciprocate
·
AI & ML interests
Reward models
Organizations
reciprocate/mistral-7b-gsm8k-code-rm
Text Classification
•
Updated
•
4
•
3
reciprocate/mistral-7b-rm
Text Classification
•
Updated
•
9
•
2
reciprocate/rm_beluga-7b_hh-full
Text Classification
•
Updated
•
5
reciprocate/rm-llama2-7b-gsm8k
Text Generation
•
Updated
•
11
reciprocate/llama2-7b-gsm8k
Text Generation
•
Updated
•
8
•
1
reciprocate/shepherd-13b
Text Generation
•
Updated
•
11
•
1
reciprocate/tiny-llama
Text Generation
•
Updated
•
33
•
2
reciprocate/vicuna-13b_rm_oasst-hh
Text Classification
•
Updated
•
29
reciprocate/openllama-13b-rlhf-v0
Text Generation
•
Updated
•
9
reciprocate/openllama-13b_rm_oasst-hh
Text Classification
•
Updated
•
13