TinyPixel
/

testmodel-3

Text Generation

text-generation-inference

Model card Files Files and versions

testmodel-3 / README.md

leaderboard-pr-bot's picture

leaderboard-pr-bot

Adding Evaluation Results

b7c211b almost 2 years ago

|

653 Bytes

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	43.48
ARC (25-shot)	53.24
HellaSwag (10-shot)	78.72
MMLU (5-shot)	46.57
TruthfulQA (0-shot)	38.75
Winogrande (5-shot)	73.88
GSM8K (5-shot)	7.58
DROP (3-shot)	5.64