Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
neuralmagic
's Collections
DeepSeek-R1-Distill Quantized
Granite 3.1 Quantization
Sparse-Llama-3.1-2of4
Vision Language Models Quantization
FP8 LLMs for vLLM
Llama-3.2 Quantization
Llama-3.1 Quantization
INT8 LLMs for vLLM
INT4 LLMs for vLLM
Sparse Foundational Llama 2 Models
Compression Papers
DeepSparse Sparse LLMs
Sparse Finetuning MPT
Compressed LLMs from the Community
Granite 3.1 Quantization
updated
Jan 24
Upvote
-
RedHatAI/granite-3.1-2b-instruct-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
134
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
1.73k
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
•
1B
•
Updated
May 30
•
1.7k
•
1
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
May 30
•
2.63k
•
1
RedHatAI/granite-3.1-2b-instruct-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 28
•
9
RedHatAI/granite-3.1-8b-instruct-FP8-dynamic
Text Generation
•
8B
•
Updated
May 30
•
139
•
1
RedHatAI/granite-3.1-2b-base-quantized.w4a16
Text Generation
•
0.5B
•
Updated
Feb 28
•
45
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation
•
3B
•
Updated
Feb 28
•
1.75k
RedHatAI/granite-3.1-8b-base-FP8-dynamic
Text Generation
•
8B
•
Updated
Feb 20
•
10
RedHatAI/granite-3.1-2b-base-FP8-dynamic
Text Generation
•
3B
•
Updated
Jan 30
•
16
RedHatAI/granite-3.1-8b-base-quantized.w4a16
Text Generation
•
1B
•
Updated
May 30
•
116
RedHatAI/granite-3.1-8b-base-quantized.w8a8
Text Generation
•
8B
•
Updated
Feb 28
•
1.74k
Upvote
-
Share collection
View history
Collection guide
Browse collections