SakanaAI
/

EvoLLM-JP-v1-7B

Text Generation

text-generation-inference

Model card Files Files and versions Community

mkshing commited on Mar 13, 2024

Commit

5587f67

·

verified ·

1 Parent(s): e0207ff

Update README.md

Files changed (1) hide show

README.md +11 -3

README.md CHANGED Viewed

@@ -71,9 +71,17 @@ print(generated_text)
 ## Evaluation
-We present the results that compares the performance of the our evolved LLMs compared to the source LLMs. To reproduce the results, please use [our Github repository](https://github.com/SakanaAI/evolving-merged-models).
-![eval-results](./evollm-math-results.png)
 ## Citation

 ## Evaluation
+We present the results on the [MGSM-JA](juletxara/mgsm) test set that compares the performance of the our evolved LLMs compared to the source LLMs. To reproduce the results, please use [our Github repository](https://github.com/SakanaAI/evolving-merged-models).
+| Id. | Model | Type | Params | MGSM-JA (acc &uarr; ) |
+| :--: | :-- | :-- | --: | --: |
+| 1 | [Shisa Gamma 7B v1](https://huggingface.co/augmxnt/shisa-gamma-7b-v1) | JA general | 7B |9.6 |
+| 2 | [WizardMath 7B V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1) | EN math | 7B | 18.4 |
+| 3 | [Abel 7B 002](https://huggingface.co/GAIR/Abel-7B-002) | EN math | 7B | 30.0 |
+| 4 | [Arithmo2 Mistral 7B](https://huggingface.co/upaya07/Arithmo2-Mistral-7B) | EN math | 7B | 24.0 |
+| 5 | [(Ours) EvoLLM-v1-JP-7B](https://huggingface.co/SakanaAI/EvoLLM-v1-JP-7B) | 1+2+3 | 7B | **52.0** |
+| 6 | [(Ours) EvoLLM-v1-JP-7B-A](https://huggingface.co/SakanaAI/EvoLLM-v1-JP-7B-A) | 1+3+4 | 7B | **52.4** |
+| 7 | [(Ours) EvoLLM-v1-JP-10B](https://huggingface.co/SakanaAI/EvoLLM-v1-JP-10B) | 1 + 5 | 10B | **55.6** |
 ## Citation