Add missing quant_config.json for compatibility with vLLM backends out of the box. (#1)
Browse files- Add missing quant_config.json for compatibility with vLLM backends out of the box. (aa2a3bfa23cced5784ce861bac33972e542ceed9)
Co-authored-by: Vaclav Kosar <[email protected]>
- quant_config.json +6 -0
quant_config.json
ADDED
@@ -0,0 +1,6 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"zero_point": true,
|
3 |
+
"q_group_size": 128,
|
4 |
+
"w_bit": 4,
|
5 |
+
"version": "GEMM"
|
6 |
+
}
|