GGUF
Fizzarolli commited on
Commit
76c7329
1 Parent(s): 82ae33b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - Open-Orca/SlimOrca-Dedup
4
+ - jondurbin/airoboros-3.2
5
+ license: other
6
+ license_name: gemma-terms-of-use
7
+ license_link: https://ai.google.dev/gemma/terms
8
+ ---
9
+ ## These are GGUF quants of [Sappha-2b-v3](https://huggingface.co/Fizzarolli/sappha-2b-v3). The original model card is below:
10
+
11
+ # sappha-2b-v3
12
+ a slightly less experimental qlora instruct finetune of the gemma-2b base model. trained with unsloth.
13
+
14
+ ## benchmarks
15
+
16
+ | | gemma-2b-it | sappha-2b-v3 | dolphin-2.8-gemma-2b |
17
+ | ---------------------- | ----------- | ------------ | -------------------- |
18
+ | MMLU (five-shot) | 36.98 | **38.02** | 37.89 |
19
+ | HellaSwag (zero-shot) | 49.22 | **51.70** | 47.79 |
20
+ | PIQA (one-shot) | 75.08 | **75.46** | 71.16 |
21
+ | TruthfulQA (zero-shot) | **37.51** | 31.65 | 37.15 |
22
+
23
+
24
+ ## prompt format
25
+ basic chatml:
26
+ ```
27
+ <|im_start|>system
28
+ You are a useful and helpful AI assistant.<|im_end|>
29
+ <|im_start|>user
30
+ what are LLMs?<|im_end|>
31
+ <|im_start|>assistant
32
+ LLMs, or Large Language Models, are advanced artificial intelligence systems that can perform tasks similar to human language. They are trained on vast amounts of data and can understand and respond to human queries. LLMs are often used in various applications, such as language translation, text generation, and question answering.<|im_end|>
33
+ ```
34
+
35
+ ## quants
36
+ gguf: https://huggingface.co/Fizzarolli/sappha-2b-v3-GGUF
37
+
38
+ ## what happened to v2?
39
+ it was a private failure :)