Zero-Mistral-Small-3.1-24B-Instruct-2503-beta

Zero-Mistral-Small-3.1 is an improved TEXT-ONLY version of mistralai/Mistral-Small-3.1-24B-Instruct-2503 based on NO-VISION anthracite-core/Mistral-Small-3.1-24B-Instruct-2503-HF, primarily adapted for Russian and English languages. The training involved SFT stage on GrandMaster-PRO-MAX dataset.

This is a beta version. Benchmarks and some more fine-tuning coming soon.

Current status:

Trained with lm_head
Train loss: 0.564200
Eval loss: 0.638504
Downloads last month
9
Safetensors
Model size
23.6B params
Tensor type
BF16
·
Inference Providers NEW
The selected billing account doesn't have any compatible Inference Provider enabled for this model. Settings

Model tree for ZeroAgency/Zero-Mistral-Small-3.1-24B-Instruct-2503-beta

Dataset used to train ZeroAgency/Zero-Mistral-Small-3.1-24B-Instruct-2503-beta