This is OpenLLaMA 3B V2 finetuned on EverythingLM Data V2(ShareGPT format) for 2 epochs.
Prompt template:
### HUMAN:
{prompt}
### RESPONSE:
<leave a newline for the model to answer>
q4_1 GGML quant available here.
q4_1 GGUF quant available here.
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric | Value |
---|---|
Avg. | 42.08 |
AI2 Reasoning Challenge (25-Shot) | 44.03 |
HellaSwag (10-Shot) | 72.92 |
MMLU (5-Shot) | 27.84 |
TruthfulQA (0-shot) | 39.92 |
Winogrande (5-shot) | 66.54 |
GSM8k (5-shot) | 1.21 |
- Downloads last month
- 1,063
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Model tree for acrastt/Marx-3B-V2
Dataset used to train acrastt/Marx-3B-V2
Spaces using acrastt/Marx-3B-V2 22
Evaluation results
- normalized accuracy on AI2 Reasoning Challenge (25-Shot)test set Open LLM Leaderboard44.030
- normalized accuracy on HellaSwag (10-Shot)validation set Open LLM Leaderboard72.920
- accuracy on MMLU (5-Shot)test set Open LLM Leaderboard27.840
- mc2 on TruthfulQA (0-shot)validation set Open LLM Leaderboard39.920
- accuracy on Winogrande (5-shot)validation set Open LLM Leaderboard66.540
- accuracy on GSM8k (5-shot)test set Open LLM Leaderboard1.210