EXL2 Quants
Collection
Exl2 quants of other people's models
•
3 items
•
Updated
Text-Only EXL2 Quant of mistralai/Mistral-Small-3.1-24B-Instruct-2503
The following changes were made:
I was having trouble with the timestamp at the beginning of the system prompt and removed it from tokenizer_config.json.
NOTE Tensor Parallel is not implemented in exllamav2 for both mistralai/Mistral-Small-3.1-24B-Instruct-2503 and mistralai/Mistral-Small-24B-Instruct-2501.
Base model
mistralai/Mistral-Small-3.1-24B-Base-2503