Salesforce
/

E1-Math-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions Community

yuhuixu commited on 12 days ago

Commit

d742e7d

·

verified ·

1 Parent(s): 503abe7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ license: cc-by-nc-4.0
 ## Introduction
 E1-Math-1.5B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
-## Performance
 | Model | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) |
 |---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|

 ## Introduction
 E1-Math-1.5B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
+## Performance (Avg@16)
 | Model | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) |
 |---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|