Distil Gemma 2 2b

This model is a gemma 2 2b model distilled from google/gemma-2-9b-it and finetuned on the tome.

image/webp

Prompt Template

ChatML

<|im_start|>system
{system}<|im_end|>
<|im_start|>user
{user}<|im_end|>
<|im_start|>assistant

Training Information

This model trained on 8x Nvidia H100 NVL for the equivalent of 120 GPU hours.

  • Loss Achieved: 0.27
  • Epochs: 3

Checkpoints are available in the repo to continue training

Evals

IN PROGRESS

Downloads last month
2
Safetensors
Model size
3.2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for macadeliccc/distil-gemma-2-2b

Base model

google/gemma-2-2b
Finetuned
(514)
this model