Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Sumail
/
Llama-3.1-8B-Instruct-125m-4bit

Text Generation
Transformers
llama
conversational
4-bit precision
gptq
Model card Files Files and versions Community
Llama-3.1-8B-Instruct-125m-4bit
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
Sumail's picture
Sumail
AutoGPTQ model for NousResearch/Meta-Llama-3.1-8B-Instruct: 4bits, gr128, desc_act=False
f40990f verified 7 months ago
  • .gitattributes
    1.52 kB
    initial commit 7 months ago
  • config.json
    1.38 kB
    AutoGPTQ model for NousResearch/Meta-Llama-3.1-8B-Instruct: 4bits, gr128, desc_act=False 7 months ago
  • gptq_model-4bit-128g.safetensors
    5.74 GB
    LFS
    AutoGPTQ model for NousResearch/Meta-Llama-3.1-8B-Instruct: 4bits, gr128, desc_act=False 7 months ago
  • quantize_config.json
    349 Bytes
    AutoGPTQ model for NousResearch/Meta-Llama-3.1-8B-Instruct: 4bits, gr128, desc_act=False 7 months ago