Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@
|
|
11 |
|
12 |
|
13 |
## Introduction
|
14 |
-
We propose **Elastic Reasoning
|
15 |
that explicitly separates reasoning into two phases—`thinking and solution`—with
|
16 |
independently allocated budgets. At test time, Elastic Reasoning prioritize that
|
17 |
completeness of solution segments, significantly improving reliability under tight
|
@@ -41,6 +41,17 @@ training.
|
|
41 |
|
42 |
[**E1-Code-14B**](https://huggingface.co/Salesforce/E1-Code-14B)
|
43 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
44 |
## Github
|
45 |
|
46 |
[Training and evaluation code](https://github.com/SalesforceAIResearch/Elastic-Reasoning)
|
|
|
11 |
|
12 |
|
13 |
## Introduction
|
14 |
+
We propose [**Elastic Reasoning**](https://arxiv.org/pdf/2505.05315), a novel framework for scalable chain of thoughts
|
15 |
that explicitly separates reasoning into two phases—`thinking and solution`—with
|
16 |
independently allocated budgets. At test time, Elastic Reasoning prioritize that
|
17 |
completeness of solution segments, significantly improving reliability under tight
|
|
|
41 |
|
42 |
[**E1-Code-14B**](https://huggingface.co/Salesforce/E1-Code-14B)
|
43 |
|
44 |
+
[**E1-Math-7B**](https://huggingface.co/Salesforce/E1-Math-7B)
|
45 |
+
|
46 |
+
## Performance (Avg@16)
|
47 |
+
|
48 |
+
| Model | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) | Tokens | Acc (%) |
|
49 |
+
|---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|--------------|---------------|
|
50 |
+
| DeepSscaleR-1.5B | 10050 | 41.0| 1488 | 5.2 | 1904 | 9.6 | 2809 | 15.8 | 3700 | 22.7 |
|
51 |
+
| E1-Math-1.5B | 6825 | 35.0 | 1340 | 13.5 | 1799 | 17.5 | 2650 | 24.8 | 3377 | 27.9 |
|
52 |
+
| Skywork-OR1-Math-7B | 13803 | 68.3 | 1534 | 1.0 | 2047 | 2.1 | 3051 | 7.7 | 4023 | 14.0 |
|
53 |
+
| E1-Math-7B | 11768 | 69.6 | 1381 | 16.9 | 1841 | 21.3 | 2799 | 26.0 | 3742 | 32.9 |
|
54 |
+
|
55 |
## Github
|
56 |
|
57 |
[Training and evaluation code](https://github.com/SalesforceAIResearch/Elastic-Reasoning)
|