How about int8 quantization?

#3
by traphix - opened

Looking forward to int8 w8a8 quantization

It's good to older GPUs, such as A100

Sign up or log in to comment