nisten commited on
Commit
357d311
1 Parent(s): 2909ebc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -1,3 +1,9 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Qwen/Qwen2-7B-Instruct
4
+ ---
5
+
6
+ These are a whole bunch of conversions of qwen7b v2 in an attempt to fix the reduced performance while quantizing.
7
+ The bf16 versions will NOT work with apple GPUs but will works with most cpus and newer nvidia cards ( older ones like 1080 series don't support bf16 inference well).
8
+
9
+ Perplexity benchmarks will come later once an automated suite is written by me or whoemever, sorry have just been too busy and doing those properly takes all day.