Questions about data scale

by masterLan - opened Feb 8

Discussion

masterLan

Feb 8

How much data was used to train the final version of Qwen-2.5-MATH-PRM?

tomasruiz

3 days ago

The final version of Qwen‑2.5‑Math‑PRM (the Qwen2.5‑Math‑7B‑PRM model) was trained on 3 million MC-estimation samples, which underwent a consensus filtering step that retained only about 40% of them. That leaves a final training set of approximately 1.2 million high-consensus samples

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment