Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

GenRM: Generative Reward Models

community
https://www.synthlabs.ai/research/generative-reward-models
synth_labs
SynthLabsAI
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

nlile  authored a paper 5 days ago
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
nlile  authored a paper 4 months ago
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
nlile  authored a paper 4 months ago
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
View all activity

nathan lile's profile picture SynthLabs's profile picture post-training's profile picture

models 0

None public yet

datasets 24

GenRM/gutenberg-dpo-v0.1-jondurbin

Viewer • Updated May 11 • 918 • 16

GenRM/HelpSteer2-DPO-Atsunori

Viewer • Updated May 11 • 7.59k • 16

GenRM/MetaMath_DPO_FewShot-abacusai

Viewer • Updated May 11 • 395k • 38

GenRM/reddit-dpo-nbeerbower

Viewer • Updated May 11 • 76.9k • 39

GenRM/function-calling-v0.2-with-r1-cot-AymanTarig

Viewer • Updated May 11 • 58k • 37

GenRM/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B-Magpie-Align

Viewer • Updated May 11 • 250k • 41

GenRM/dolphin-r1-cognitivecomputations

Updated May 11 • 13

GenRM/SCP-116K-EricLu

Updated May 11 • 7

GenRM/Bespoke-Stratos-17k-bespokelabs

Viewer • Updated May 11 • 16.7k • 13

GenRM/OpenThoughts-114k-open-thoughts

Viewer • Updated May 11 • 114k • 40
View 24 datasets
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs