Mistral Small Reasoning GGUF

Original model: Mistral-Small-24B-Instruct-2501-reasoning

Model creator: yentinglin

This model is a fine-tuned version of mistralai/Mistral-Small-24B-Instruct-2501, specifically optimized for mathematical reasoning tasks. It has been fine-tuned on datasets including OpenR1-Math-220k, and s1K-1.1, aiming to enhance its reasoning capabilities.

This repo contains GGUF format model files for Yen-Ting Lin’s Mistral Small Reasoning.

What is GGUF?

GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023.

Converted with llama.cpp build 4735 (revision 73e2ed3), using autogguf-rs.

Prompt template: Mistral Instruct (New)


[SYSTEM_PROMPT]{{system_message}}[/SYSTEM_PROMPT]

[INST]{{prompt}}[/INST]
{{assistant_message}}

Download & run with cnvrs on iPhone, iPad, and Mac!

cnvrs.ai

cnvrs is the best app for private, local AI on your device:


Original Model Evaluation

The evaluation code is available at Hugging Face Open-R1. Note that I have updated the AIME 25 dataset to the full set, available at AIME 2025.

Our results below are averaged over multiple runs. See our eval details here.

Pass@1 # Params MATH-500 AIME 2025 AIME 2024 GPQA Diamond
Mistral-24B-Reasoning (Ours) 24B 95.0 53.33 66.67 62.02
Mistral-24B-Instruct 24B 70.6 - - 45.3
s1.1-32B 32B 93.2 40.0 56.7 61.62
LIMO 32B 94.8 36.67 57.1 59.09
DeepSeek-R1-Distill-Llama-70B 70B 94.5 46.67 70.0 65.2
DeepSeek-R1-Distill-Qwen-32B 32B 94.3 60.0 72.6 62.1
DeepSeek-R1 671B 97.3 70.0 72.6 71.5
o1 - 96.4 79.0 - 75.7
o3-mini (high) - 97.9 86.5 - 77.2
o3-mini (medium) - 97.3 76.5 - 74.9
Downloads last month
2,925
GGUF
Model size
23.6B params
Architecture
llama

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for brittlewis12/Mistral-Small-24B-Instruct-2501-reasoning-GGUF

Datasets used to train brittlewis12/Mistral-Small-24B-Instruct-2501-reasoning-GGUF