jiyuanq
/

falcon-40b-instruct-gptq-128g-act

Text Generation

text-generation-inference

Model card Files Files and versions Community

falcon-40b-instruct quantized with GPTQ using the script in https://github.com/huggingface/text-generation-inference/pull/438

group size: 128
act order: true
nsamples: 128
dataset: wikitext2

Downloads last month: 13

Safetensors

Model size

6.53B params

Tensor type

I64

·

I32

·

F16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support