File size: 538 Bytes
4fa4f1c
 
bd177bf
 
 
 
 
2367a14
bd177bf
928ea15
 
d1fede7
2367a14
 
4fa4f1c
 
2367a14
 
 
 
 
 
4fa4f1c
2367a14
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
library_name: transformers
license: apache-2.0
language:
- en
base_model:
- HuggingFaceTB/SmolVLM-Instruct
---

4bit nf4 quantized version, you can find the quantized version generation code below.


```
from transformers import BitsAndBytesConfig


nf4_config = BitsAndBytesConfig(
   load_in_4bit=True,
   bnb_4bit_quant_type="nf4",
   bnb_4bit_use_double_quant=True,
   bnb_4bit_compute_dtype=torch.bfloat16
)

model_nf4 = AutoModelForVision2Seq.from_pretrained("HuggingFaceTB/SmolVLM-Instruct", quantization_config=nf4_config)
```