sleepdeprived3 commited on
Commit
298c9bd
·
verified ·
1 Parent(s): b5dd862

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -0
README.md ADDED
@@ -0,0 +1,43 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2
4
+ - nbeerbower/EVA-Gutenberg3-Qwen2.5-32B
5
+ - nbeerbower/DeepSeek-R1-Qwen-lorablated-32B
6
+ - nbeerbower/Rombos-Qwen2.5-32B-lorablated
7
+ library_name: transformers
8
+ tags:
9
+ - mergekit
10
+ - merge
11
+ license: apache-2.0
12
+ ---
13
+ # EVA-Rombos1-Qwen2.5-32B
14
+
15
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
+
17
+ ## Merge Details
18
+ ### Merge Method
19
+
20
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using DeepSeek-R1-Qwen-lorablated-32B as a base.
21
+
22
+ ### Models Merged
23
+
24
+ The following models were included in the merge:
25
+ * Rombos-Qwen2.5-32B-lorablated
26
+ * EVA-UNIT-01_EVA-Qwen2.5-32B-v0.2
27
+ * EVA-Gutenberg3-Qwen2.5-32B
28
+
29
+ ### Configuration
30
+
31
+ The following YAML configuration was used to produce this model:
32
+
33
+ ```yaml
34
+ models:
35
+ - model: nbeerbower/Rombos-Qwen2.5-32B-lorablated
36
+ - model: EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2
37
+ - model: nbeerbower/EVA-Gutenberg3-Qwen2.5-32B
38
+ merge_method: model_stock
39
+ base_model: nbeerbower/DeepSeek-R1-Qwen-lorablated-32B
40
+ dtype: bfloat16
41
+
42
+
43
+ ```