AlekseiPravdin commited on
Commit
6bac1c3
1 Parent(s): 31c2972

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ license: apache-2.0
4
+ tags:
5
+ - merge
6
+ - mergekit
7
+ - lazymergekit
8
+ - NousResearch/Hermes-2-Pro-Llama-3-8B
9
+ - shenzhi-wang/Llama3-8B-Chinese-Chat
10
+ ---
11
+
12
+ Quantized GGUF model Hermes-2-Pro-Llama-3-8B-Llama3-8B-Chinese-Chat-slerp-merge-gguf
13
+
14
+ Hermes-2-Pro-Llama-3-8B-Llama3-8B-Chinese-Chat-slerp-merge-gguf is a quantized model using [llama.cpp](https://github.com/ggerganov/llama.cpp) `llama-quantize`
15
+
16
+
17
+
18
+ # Hermes-2-Pro-Llama-3-8B-Llama3-8B-Chinese-Chat-slerp-merge
19
+
20
+ Hermes-2-Pro-Llama-3-8B-Llama3-8B-Chinese-Chat-slerp-merge is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
21
+ * [NousResearch/Hermes-2-Pro-Llama-3-8B](https://huggingface.co/NousResearch/Hermes-2-Pro-Llama-3-8B)
22
+ * [shenzhi-wang/Llama3-8B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat)
23
+
24
+ ## 🧩 Merge Configuration
25
+
26
+ ```yaml
27
+ slices:
28
+ - sources:
29
+ - model: NousResearch/Hermes-2-Pro-Llama-3-8B
30
+ layer_range: [0, 31]
31
+ - model: shenzhi-wang/Llama3-8B-Chinese-Chat
32
+ layer_range: [0, 31]
33
+ merge_method: slerp
34
+ base_model: NousResearch/Hermes-2-Pro-Llama-3-8B
35
+ parameters:
36
+ t:
37
+ - filter: self_attn
38
+ value: [0, 0.5, 0.3, 0.7, 1]
39
+ - filter: mlp
40
+ value: [1, 0.5, 0.7, 0.3, 0]
41
+ - value: 0.5
42
+ dtype: float16
43
+ ```