Performance LLMs - Base Models
Collection
22 items
•
Updated
•
7
Open Source License The code is licensed under Apache-2.0, while model weights are fully open for academic research and also allow free commercial usage. To apply for a commercial license, please fill in the application form (English)/申请表(中文). For other questions or collaborations, please contact [email protected].
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 65.09 |
AI2 Reasoning Challenge (25-Shot) | 61.35 |
HellaSwag (10-Shot) | 82.08 |
MMLU (5-Shot) | 61.59 |
TruthfulQA (0-shot) | 57.71 |
Winogrande (5-shot) | 76.72 |
GSM8k (5-shot) | 51.10 |