YOYO-AI commited on
Commit
97dd96f
·
verified ·
1 Parent(s): d933fed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -3
README.md CHANGED
@@ -1,3 +1,41 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - zh
6
+ base_model:
7
+ - Qwen/Qwen3-30B-A3B-Thinking-2507
8
+ - Qwen/Qwen3-Coder-30B-A3B-Instruct
9
+ pipeline_tag: text-generation
10
+ tags:
11
+ - merge
12
+ ---
13
+
14
+ ## *Model Highlights:*
15
+
16
+ - ***merge method**: `linear`*
17
+
18
+ - ***Highest precision**: `dtype: float32` + `out_dtype: bfloat16`*
19
+
20
+ - ***Context length**: `262,144`*
21
+
22
+ ## *Parameter Settings*:
23
+ > [!NOTE]
24
+ > *`Temperature=0.7`, `TopP=0.8`, `TopK=20`,`MinP=0`.*
25
+ ## *Configuration*:
26
+ *The following YAML configuration was used to produce this model:*
27
+
28
+ ```yaml
29
+ models:
30
+ - model: Qwen/Qwen3-30B-A3B-Thinking-2507
31
+ parameters:
32
+ weight: 0.9
33
+ - model: Qwen/Qwen3-Coder-30B-A3B-Instruct
34
+ parameters:
35
+ weight: 0.1
36
+ merge_method: linear
37
+ tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
38
+ dtype: float32
39
+ out_dtype: bfloat16
40
+ ```
41
+