mengqin1 commited on
Commit
2056a9d
·
verified ·
1 Parent(s): 0b3aad7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -34,7 +34,7 @@ This approach reduces memory usage during quantization to ~40 GB and is **entire
34
 
35
  ## Performance
36
 
37
- - The quantized models perform **similarly** to the original nf4 quantized version.
38
  - No formal benchmark or deep evaluation has been conducted.
39
 
40
  ## Files
 
34
 
35
  ## Performance
36
 
37
+ - The quantized models perform **similarly** to the Q4_0 quantized version.
38
  - No formal benchmark or deep evaluation has been conducted.
39
 
40
  ## Files