Quantized version of yentinglin/Mistral-Small-24B-Instruct-2501-reasoning. Tested to work with llama.cpp and LM Studio

Downloads last month
1
GGUF
Model size
23.6B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support