Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wangclnlp
/
GRAM-RR-LLaMA-3.2-3B-RewardModel
like
0
Text Generation
Safetensors
English
llama
Reward
RewardModel
RewardReasoning
Reasoning
RLHF
Best-of-N
conversational
arxiv:
2509.02492
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
03cbf0d
GRAM-RR-LLaMA-3.2-3B-RewardModel
1.55 kB
1 contributor
History:
1 commit
wangclnlp
initial commit
03cbf0d
verified
3 months ago
.gitattributes
1.52 kB
initial commit
3 months ago
README.md
31 Bytes
initial commit
3 months ago