gpu is not used during quanization 8 bit of llama 8b instruct only ram?
Hey! In order to use imatrix quantization you have to check the box below the text before submitting in order to offload to GPU :)
thanks for quick answer!
· Sign up or log in to comment