marcuscedricridia lbourdois commited on
Commit
6740c5d
·
verified ·
1 Parent(s): 01cfc8a

Improve language tag (#1)

Browse files

- Improve language tag (f925eb5a3ecc8fae33c1f9660f3798e7d9ab71ef)


Co-authored-by: Loïck BOURDOIS <[email protected]>

Files changed (1) hide show
  1. README.md +62 -49
README.md CHANGED
@@ -1,49 +1,62 @@
1
- ---
2
- base_model:
3
- - Cran-May/T.E-8.1
4
- - marcuscedricridia/Yell-Qwen2.5-7B-Coder
5
- - marcuscedricridia/Hush-Qwen2.5-7B-Preview
6
- - bunnycore/Blabbertron-1.1
7
- - Qwen/Qwen2.5-7B-Instruct
8
- library_name: transformers
9
- tags:
10
- - mergekit
11
- - merge
12
-
13
- ---
14
- # merge
15
-
16
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
17
-
18
- ## Merge Details
19
- ### Merge Method
20
-
21
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) as a base.
22
-
23
- ### Models Merged
24
-
25
- The following models were included in the merge:
26
- * [Cran-May/T.E-8.1](https://huggingface.co/Cran-May/T.E-8.1)
27
- * [marcuscedricridia/Yell-Qwen2.5-7B-Coder](https://huggingface.co/marcuscedricridia/Yell-Qwen2.5-7B-Coder)
28
- * [marcuscedricridia/Hush-Qwen2.5-7B-Preview](https://huggingface.co/marcuscedricridia/Hush-Qwen2.5-7B-Preview)
29
- * [bunnycore/Blabbertron-1.1](https://huggingface.co/bunnycore/Blabbertron-1.1)
30
-
31
- ### Configuration
32
-
33
- The following YAML configuration was used to produce this model:
34
-
35
- ```yaml
36
- merge_method: model_stock
37
- base_model: Qwen/Qwen2.5-7B-Instruct
38
- models:
39
- - model: marcuscedricridia/Yell-Qwen2.5-7B-Coder
40
- - model: Cran-May/T.E-8.1
41
- - model: marcuscedricridia/Hush-Qwen2.5-7B-Preview
42
- - model: bunnycore/Blabbertron-1.1
43
- dtype: bfloat16
44
- tokenizer_source: base
45
- int8_mask: true
46
- normalize: true
47
- name: Yell-Qwen2.5-7B-Stock-v1.1
48
-
49
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Cran-May/T.E-8.1
4
+ - marcuscedricridia/Yell-Qwen2.5-7B-Coder
5
+ - marcuscedricridia/Hush-Qwen2.5-7B-Preview
6
+ - bunnycore/Blabbertron-1.1
7
+ - Qwen/Qwen2.5-7B-Instruct
8
+ library_name: transformers
9
+ tags:
10
+ - mergekit
11
+ - merge
12
+ language:
13
+ - zho
14
+ - eng
15
+ - fra
16
+ - spa
17
+ - por
18
+ - deu
19
+ - ita
20
+ - rus
21
+ - jpn
22
+ - kor
23
+ - vie
24
+ - tha
25
+ - ara
26
+ ---
27
+ # merge
28
+
29
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
30
+
31
+ ## Merge Details
32
+ ### Merge Method
33
+
34
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) as a base.
35
+
36
+ ### Models Merged
37
+
38
+ The following models were included in the merge:
39
+ * [Cran-May/T.E-8.1](https://huggingface.co/Cran-May/T.E-8.1)
40
+ * [marcuscedricridia/Yell-Qwen2.5-7B-Coder](https://huggingface.co/marcuscedricridia/Yell-Qwen2.5-7B-Coder)
41
+ * [marcuscedricridia/Hush-Qwen2.5-7B-Preview](https://huggingface.co/marcuscedricridia/Hush-Qwen2.5-7B-Preview)
42
+ * [bunnycore/Blabbertron-1.1](https://huggingface.co/bunnycore/Blabbertron-1.1)
43
+
44
+ ### Configuration
45
+
46
+ The following YAML configuration was used to produce this model:
47
+
48
+ ```yaml
49
+ merge_method: model_stock
50
+ base_model: Qwen/Qwen2.5-7B-Instruct
51
+ models:
52
+ - model: marcuscedricridia/Yell-Qwen2.5-7B-Coder
53
+ - model: Cran-May/T.E-8.1
54
+ - model: marcuscedricridia/Hush-Qwen2.5-7B-Preview
55
+ - model: bunnycore/Blabbertron-1.1
56
+ dtype: bfloat16
57
+ tokenizer_source: base
58
+ int8_mask: true
59
+ normalize: true
60
+ name: Yell-Qwen2.5-7B-Stock-v1.1
61
+
62
+ ```