nbeerbower
/

Gemma2-Gutenberg-Doppel-9B

Text Generation

text-generation-inference

Model card Files Files and versions Community

Gemma2-Gutenberg-Doppel-9B

UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3 finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.

Method

ORPO finetuned using 2x A40 for 3 epochs.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	29.82
IFEval (0-Shot)	71.71
BBH (3-Shot)	41.08
MATH Lvl 5 (4-Shot)	3.47
GPQA (0-shot)	10.63
MuSR (0-shot)	17.30
MMLU-PRO (5-shot)	34.75

Downloads last month: 6

Safetensors

Model size

9.24B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nbeerbower/Gemma2-Gutenberg-Doppel-9B

Base model

UCLA-AGI/Gemma-2-9B-It-SPPO-Iter3

Finetuned

(10)

this model

Finetunes

1 model

Merges

Quantizations

Datasets used to train nbeerbower/Gemma2-Gutenberg-Doppel-9B

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

71.710
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

41.080
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

3.470
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

10.630
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

17.300
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

34.750

View on Papers With Code