Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Qwen
/
Qwen2.5-Math-PRM-7B

Text Classification
Transformers
Safetensors
English
Chinese
qwen2
feature-extraction
reward model
custom_code
text-generation-inference
Model card Files Files and versions
xet
Community
11
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

this case maybe not suitable?

#11 opened 2 months ago by
HarryJan

Error loading model

2
#10 opened 3 months ago by
lmiller-phdata

Questions about data scale

#9 opened 4 months ago by
masterLan

Ask questions about training data construction

1
#8 opened 4 months ago by
zzzzz2023

A question about the effectiveness of Qwen2.5-Math-PRM-7B in reinforcement learning

#7 opened 4 months ago by
zsyyy

If the response length exceeds 4096, is a sliding window used, or is it simply truncated?

#6 opened 4 months ago by
ShelterW

question about the step separato "\n\n"

1
#3 opened 4 months ago by
pixas

Could you clarify whether the PRM800K deduplication was performed using the original 5000-test set from MATH or the MATH500 dataset?

3
#2 opened 4 months ago by
masterLan

vllm support

3
#1 opened 4 months ago by
baohao
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs