The official prequantized EfficientQAT models.
-
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128
Text Generation • Updated • 5 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64
Text Generation • Updated • 3 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128
Text Generation • Updated • 7 -
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128
Text Generation • Updated • 11