Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
chatglm2-6b-int4
like
234
Transformers
PyTorch
Chinese
English
chatglm
glm
thudm
custom_code
Inference Endpoints
4 papers
Model card
Files
Files and versions
Community
23
Train
Deploy
Use this model
main
chatglm2-6b-int4
/
quantization.py
Commit History
Update quantized gemm kernel
5579a9f
duzx16
commited on
Jul 16, 2023
Add cpu kernel
8b97bf2
duzx16
commited on
Jun 26, 2023
Init commit
8668ecb
duzx16
commited on
Jun 25, 2023