Llama-3.1-8B-Instruct-125m-4bit / gptq_model-4bit-128g.safetensors

Commit History

AutoGPTQ model for NousResearch/Meta-Llama-3.1-8B-Instruct: 4bits, gr128, desc_act=False
f40990f
verified

Sumail commited on