Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

GenRM: Generative Reward Models

community
https://www.synthlabs.ai/research/generative-reward-models
synth_labs
SynthLabsAI
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

nlile  authored a paper 6 days ago
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
nlile  authored a paper 4 months ago
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
nlile  authored a paper 4 months ago
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
View all activity

nathan lile's profile picture SynthLabs's profile picture post-training's profile picture

GenRM 's models

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs