nbeerbower
/

mistral-nemo-gutades-12B

Text Generation

text-generation-inference

Model card Files Files and versions Community

mistral-nemo-gutades-12B

nbeerbower/mistral-nemo-bophades-12B finetuned on jondurbin/gutenberg-dpo-v0.1.

Method

ORPO finetuned using an RTX 3090 for 3 epochs.

Fine-tune Llama 3 with ORPO

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	20.76
IFEval (0-Shot)	34.25
BBH (3-Shot)	34.57
MATH Lvl 5 (4-Shot)	9.89
GPQA (0-shot)	8.72
MuSR (0-shot)	8.67
MMLU-PRO (5-shot)	28.45

Downloads last month: 8

Safetensors

Model size

12.2B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nbeerbower/mistral-nemo-gutades-12B

Base model

mistralai/Mistral-Nemo-Base-2407

Finetuned

mistralai/Mistral-Nemo-Instruct-2407

Finetuned

nbeerbower/mistral-nemo-bophades-12B

Finetuned

(1)

this model

Quantizations

Dataset used to train nbeerbower/mistral-nemo-gutades-12B

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

34.250
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

34.570
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

9.890
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

8.720
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

8.670
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

28.450

View on Papers With Code