Salesforce
/

E1-Code-14B

Text Generation

text-generation-inference

Model card Files Files and versions Community

yuhuixu commited on 6 days ago

Commit

9eebf1f

·

verified ·

1 Parent(s): 35ed824

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ license: cc-by-nc-4.0
 ---
 ## Introduction
-E1-Code-14B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-14B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
 ## Usage
 For detailed usage, please refer to [repo](https://github.com/SalesforceAIResearch/Elastic-Reasoning).

 ---
 ## Introduction
+E1-Code-14B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-14B. It is trained for [**Elastic Reasoning**](https://arxiv.org/pdf/2505.05315) by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
 ## Usage
 For detailed usage, please refer to [repo](https://github.com/SalesforceAIResearch/Elastic-Reasoning).