Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dmis-lab 's Collections
Outlier-Safe Pre-Training (OSP)
Med-PRM
Meerkat
ANGEL
OLAPH
Self-BioRAG
TouR
BioSyn
BioBERT

Med-PRM

updated 7 days ago

This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

Upvote
-

  • dmis-lab/llama-3.1-medprm-reward-v1.0

    Text Generation • 8B • Updated 30 days ago • 397 • 13

  • dmis-lab/llama-3.1-medprm-reward-raw-training-set

    Viewer • Updated 7 days ago • 11.7k • 8

  • dmis-lab/llama-3.1-medprm-reward-training-set

    Viewer • Updated 30 days ago • 11.7k • 177 • 5

  • dmis-lab/llama-3.1-medprm-reward-test-set

    Updated 30 days ago • 89 • 2

  • Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

    Paper • 2506.11474 • Published Jun 13 • 17
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs