Mistral Small Reasoning GGUF
Original model: Mistral-Small-24B-Instruct-2501-reasoning
Model creator: yentinglin
This model is a fine-tuned version of mistralai/Mistral-Small-24B-Instruct-2501, specifically optimized for mathematical reasoning tasks. It has been fine-tuned on datasets including OpenR1-Math-220k, and s1K-1.1, aiming to enhance its reasoning capabilities.
This repo contains GGUF format model files for Yen-Ting Lin’s Mistral Small Reasoning.
What is GGUF?
GGUF is a file format for representing AI models. It is the third version of the format, introduced by the llama.cpp team on August 21st 2023.
Converted with llama.cpp build 4735 (revision 73e2ed3), using autogguf-rs.
Prompt template: Mistral Instruct (New)
[SYSTEM_PROMPT]{{system_message}}[/SYSTEM_PROMPT]
[INST]{{prompt}}[/INST]
{{assistant_message}}
Download & run with cnvrs on iPhone, iPad, and Mac!
cnvrs is the best app for private, local AI on your device:
- create & save Characters with custom system prompts & temperature settings
- download and experiment with any GGUF model you can find on HuggingFace!
- or, use an API key with the chat completions-compatible model provider of your choice -- ChatGPT, Claude, Gemini, DeepSeek, & more!
- make it your own with custom Theme colors
- powered by Metal ⚡️ & Llama.cpp, with haptics during response streaming!
- try it out yourself today, on Testflight!
- if you already have the app, download Mistral Small Reasoning now!
- cnvrsai:///models/search/hf?id=brittlewis12/Mistral-Small-24B-Instruct-2501-reasoning-GGUF
- follow cnvrs on twitter to stay up to date
Original Model Evaluation
The evaluation code is available at Hugging Face Open-R1. Note that I have updated the AIME 25 dataset to the full set, available at AIME 2025.
Our results below are averaged over multiple runs. See our eval details here.
Pass@1 | # Params | MATH-500 | AIME 2025 | AIME 2024 | GPQA Diamond |
---|---|---|---|---|---|
Mistral-24B-Reasoning (Ours) | 24B | 95.0 | 53.33 | 66.67 | 62.02 |
Mistral-24B-Instruct | 24B | 70.6 | - | - | 45.3 |
s1.1-32B | 32B | 93.2 | 40.0 | 56.7 | 61.62 |
LIMO | 32B | 94.8 | 36.67 | 57.1 | 59.09 |
DeepSeek-R1-Distill-Llama-70B | 70B | 94.5 | 46.67 | 70.0 | 65.2 |
DeepSeek-R1-Distill-Qwen-32B | 32B | 94.3 | 60.0 | 72.6 | 62.1 |
DeepSeek-R1 | 671B | 97.3 | 70.0 | 72.6 | 71.5 |
o1 | - | 96.4 | 79.0 | - | 75.7 |
o3-mini (high) | - | 97.9 | 86.5 | - | 77.2 |
o3-mini (medium) | - | 97.3 | 76.5 | - | 74.9 |
- Downloads last month
- 2,925
Model tree for brittlewis12/Mistral-Small-24B-Instruct-2501-reasoning-GGUF
Base model
mistralai/Mistral-Small-24B-Base-2501