mistral-nemo-gutenberg-12B-v4
TheDrummer/Rocinante-12B-v1 finetuned on jondurbin/gutenberg-dpo-v0.1.
Method
Finetuned using an A100 on Google Colab for 3 epochs.
Fine-tune Llama 3 with ORPO
Open LLM Leaderboard Evaluation Results
Detailed results can be found here
Metric |
Value |
Avg. |
19.56 |
IFEval (0-Shot) |
23.79 |
BBH (3-Shot) |
31.97 |
MATH Lvl 5 (4-Shot) |
10.95 |
GPQA (0-shot) |
8.84 |
MuSR (0-shot) |
13.20 |
MMLU-PRO (5-shot) |
28.62 |