Spaces:
Running
on
A10G
Running
on
A10G
inconsistent quantization for authenticated repos? out of space or rate limit?
#109
by
5fp
- opened
I have same error I gguf 8 bit llama 3.1 8b them I want 4 bit and same problem that dont have pth file(I can 4 bit if space restart by schedule)
Closing this as it's an error for a base model, which doesn't make much sense to quantize in the first place.
reach-vb
changed discussion status to
closed