YOYO-AI
/

Qwen3-14B-YOYO

Text Generation

text-generation-inference

Model card Files Files and versions

YOYO-AI commited on May 20

Commit

1ea3f32

·

verified ·

1 Parent(s): 98692a7

Update README.md

Files changed (1) hide show

README.md +52 -3

README.md CHANGED Viewed

@@ -1,3 +1,52 @@
----
-license: apache-2.0
----

+---
+base_model:
+- Qwen/Qwen3-14B
+- Qwen/Qwen3-14B-Base
+library_name: transformers
+tags:
+- mergekit
+- merge
+license: apache-2.0
+language:
+- en
+- zh
+pipeline_tag: text-generation
+---
+# Qwen3-14B-YOYO
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [DELLA](https://arxiv.org/abs/2406.11617) merge method using [Qwen/Qwen3-14B-Base](https://huggingface.co/Qwen/Qwen3-14B-Base) as a base.
+### Models Merged
+The following models were included in the merge:
+* [Qwen/Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B)
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+models:
+  - model: Qwen/Qwen3-14B
+    parameters:
+      density: 0.5
+      weight: 1
+      lambda: 0.9
+merge_method: della
+base_model: Qwen/Qwen3-14B-Base
+parameters:
+  density: 1
+  weight: 1
+  lambda: 0.9
+  normalize: true
+  int8_mask: true
+dtype: bfloat16
+chat_template: "chatml"
+tokenizer_source: Qwen/Qwen3-14B
+```