Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ISTA-DASLab
/
DeepSeek-R1-GPTQ-4b-128g-experts
like
3
Follow
IST Austria Distributed Algorithms and Systems Lab
92
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
compressed-tensors
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
How to Only compress non-shared experts within transformer blocks?
#1 opened 17 days ago by
CobraMamba