Salesforce
/

E1-Math-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions Community

yuhuixu commited on May 25

Commit

3fa7501

·

verified ·

1 Parent(s): 8323c8e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ license: cc-by-nc-4.0
 ---
 ## Introduction
-E1-Math-1.5B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
 ## Performance (Avg@16)

 ---
 ## Introduction
+E1-Math-1.5B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B. It is trained for [**Elastic Reasoning**](https://arxiv.org/pdf/2505.05315) by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
 ## Performance (Avg@16)