This is another training run of SmolLlamix-8x101M with slightly different hyperparameters. Just testing to see how it holds up against the first run.

Downloads last month
1,082
Safetensors
Model size
399M params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for chargoddard/SmolLlamix-8x101M-take2

Quantizations
1 model

Dataset used to train chargoddard/SmolLlamix-8x101M-take2