One-Shot RLVR
Collection
Collections of models and papers for works: "Reinforcement Learning for Reasoning in Large Language Models with One Training Example"
•
8 items
•
Updated
•
1
This repository contains the model presented in Reinforcement Learning for Reasoning in Large Language Models with One Training Example.