Abouth w8a8 method
#1
by
a-r-c
- opened
What quantize method you use
GPTQ Int8 for weights.
Is this padded for multi-GPU use?
edit
What quantize method you use
GPTQ Int8 for weights.
Is this padded for multi-GPU use?
edit