Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

TheBloke
/
Qwen-7B-Chat-AWQ

Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
4-bit precision
awq
Model card Files Files and versions Community
1
Qwen-7B-Chat-AWQ
Ctrl+K
Ctrl+K
  • 1 contributor
History: 5 commits
TheBloke's picture
TheBloke
Fix config.json, disabling Flash Attention
9be754f over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • LICENSE
    6.9 kB
    AWQ model commit over 1 year ago
  • NOTICE
    2.7 kB
    AWQ model commit over 1 year ago
  • README.md
    48.4 kB
    Upload README.md over 1 year ago
  • config.json
    1.28 kB
    Fix config.json, disabling Flash Attention over 1 year ago
  • configuration_qwen.py
    2.35 kB
    AWQ model commit over 1 year ago
  • cpp_kernels.py
    1.92 kB
    AWQ model commit over 1 year ago
  • generation_config.json
    249 Bytes
    AWQ model commit over 1 year ago
  • model.safetensors
    5.86 GB
    LFS
    AWQ model commit over 1 year ago
  • modeling_qwen.py
    55.8 kB
    AWQ model commit over 1 year ago
  • quant_config.json
    90 Bytes
    AWQ model commit over 1 year ago
  • qwen.tiktoken
    2.56 MB
    AWQ model commit over 1 year ago
  • qwen_generation_utils.py
    14.6 kB
    AWQ model commit over 1 year ago
  • special_tokens_map.json
    3 Bytes
    AWQ model commit over 1 year ago
  • tokenization_qwen.py
    9.62 kB
    AWQ model commit over 1 year ago
  • tokenizer_config.json
    173 Bytes
    AWQ model commit over 1 year ago