ISTA-DASLab
/

c4ai-command-r-plus-AQLM-2Bit-1x16

For this quantization, we used 1 codebook of 16 bits.

Results:

Model	Quantization	MMLU (5-shot)	Model size, Gb
CohereForAI/c4ai-command-r-v01	None	0.7425	208
	1x16	0.6795	31.9

Safetensors

Model size

16B params

Tensor type

F16

I16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including ISTA-DASLab/c4ai-command-r-plus-AQLM-2Bit-1x16