AstroMLab
/

astrollama-2-70b-chat_aic

Text Generation

text-generation-inference

Model card Files Files and versions Community

tingyuansen commited on Sep 30, 2024

Commit

665c5fe

·

verified ·

1 Parent(s): 050a4a8

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -82,6 +82,17 @@ print(f"Assistant: {response}")
 While the AstroLLaMA-2-70B-Base_AIC model demonstrated significant improvements over its baseline LLaMA-2-70B model, the chat version (AstroLLaMA-2-70B-Chat_AIC) experiences performance degradation due to limitations in the SFT process. Here's a performance comparison:
 Key limitations:

 While the AstroLLaMA-2-70B-Base_AIC model demonstrated significant improvements over its baseline LLaMA-2-70B model, the chat version (AstroLLaMA-2-70B-Chat_AIC) experiences performance degradation due to limitations in the SFT process. Here's a performance comparison:
+| Model | Score (%) |
+|-------|-----------|
+| **<span style="color:green">AstroLLaMA-2-70B-Base (AstroMLab)</span>** | **<span style="color:green">76.0</span>** |
+| LLaMA-3.1-8B | 73.7 |
+| LLaMA-2-70B | 70.7 |
+| Gemma-2-9B | 71.5 |
+| Qwen-2.5-7B | 70.4 |
+| Yi-1.5-9B | 68.4 |
+| InternLM-2.5-7B | 64.5 |
+| Mistral-7B-v0.3 | 63.9 |
+| ChatGLM3-6B | 50.4 |
 Key limitations: