Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
yuhuixu commited on
Commit
35ed824
·
verified ·
1 Parent(s): e4f3e8e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -8,7 +8,7 @@ license: cc-by-nc-4.0
8
  ---
9
 
10
  ## Introduction
11
- E1-Math-14B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-14B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
12
 
13
  ## Usage
14
  For detailed usage, please refer to [repo](https://github.com/SalesforceAIResearch/Elastic-Reasoning).
 
8
  ---
9
 
10
  ## Introduction
11
+ E1-Code-14B is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-14B. It is trained for Elastic Reasoning by budget-constrained rollout strategy, integrated into GRPO, which teaches the model to reason adaptively when the thinking process is cut short and generalizes effectively to unseen budget constraints without additional training.
12
 
13
  ## Usage
14
  For detailed usage, please refer to [repo](https://github.com/SalesforceAIResearch/Elastic-Reasoning).