Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ray2333 's Collections
GRM

GRM

updated Nov 25, 2024

Generalizable Reward Models

Upvote
4

  • Ray2333/GRM-llama3-8B-sftreg

    Text Classification • Updated Feb 5 • 116 • 5

  • Ray2333/GRM-llama3-8B-distill

    Text Classification • Updated Feb 5 • 247 • 6

  • Ray2333/GRM-Gemma-2B-sftreg

    Text Classification • Updated Feb 5 • 47 • 3

  • Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

    Paper • 2406.10216 • Published Jun 14, 2024 • 2

  • Ray2333/GRM-Gemma-2B-rewardmodel-ft

    Updated Feb 5 • 51 • 1

  • Ray2333/GRM-Llama3-8B-rewardmodel-ft

    Updated Feb 5 • 463 • 1

  • Ray2333/GRM-llama3.2-3B-sftreg

    Text Classification • Updated Feb 5 • 18 • 1

  • Ray2333/GRM-Gemma2-2B-sftreg

    Text Classification • Updated Feb 5 • 11 • 1

  • Ray2333/GRM-Llama3.2-3B-rewardmodel-ft

    Text Classification • Updated Apr 30 • 1.73k • 13

  • Ray2333/GRM-gemma2-2B-rewardmodel-ft

    Text Classification • Updated Feb 5 • 1.19k • 7

  • Ray2333/GRM_Llama3.1_8B_rewardmodel-ft

    Text Classification • Updated Feb 5 • 205 • 5
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs