Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

GenRM: Generative Reward Models

community
https://www.synthlabs.ai/research/generative-reward-models
synth_labs
SynthLabsAI
Activity Feed

AI & ML interests

None defined yet.

nathan lile's profile picture SynthLabs's profile picture post-training's profile picture

models 0

None public yet

datasets 24

GenRM/gutenberg-dpo-v0.1-jondurbin

Viewer • Updated 26 days ago • 918 • 56

GenRM/HelpSteer2-DPO-Atsunori

Viewer • Updated 26 days ago • 7.59k • 50

GenRM/MetaMath_DPO_FewShot-abacusai

Viewer • Updated 26 days ago • 395k • 72

GenRM/reddit-dpo-nbeerbower

Viewer • Updated 26 days ago • 76.9k • 59

GenRM/function-calling-v0.2-with-r1-cot-AymanTarig

Viewer • Updated 26 days ago • 58k • 62

GenRM/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B-Magpie-Align

Viewer • Updated 26 days ago • 250k • 60

GenRM/dolphin-r1-cognitivecomputations

Updated 26 days ago • 50

GenRM/SCP-116K-EricLu

Updated 26 days ago • 52

GenRM/Bespoke-Stratos-17k-bespokelabs

Viewer • Updated 26 days ago • 16.7k • 53

GenRM/OpenThoughts-114k-open-thoughts

Viewer • Updated 26 days ago • 114k • 83
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs