YOYO-AI
/

Qwen3-30B-A3B-CoderThinking-YOYO-linear

Text Generation

Model card Files Files and versions

YOYO-AI commited on Aug 5

Commit

97dd96f

·

verified ·

1 Parent(s): d933fed

Update README.md

Files changed (1) hide show

README.md +41 -3

README.md CHANGED Viewed

@@ -1,3 +1,41 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+- en
+- zh
+base_model:
+- Qwen/Qwen3-30B-A3B-Thinking-2507
+- Qwen/Qwen3-Coder-30B-A3B-Instruct
+pipeline_tag: text-generation
+tags:
+- merge
+---
+## *Model Highlights:*
+- ***merge method**: `linear`*
+- ***Highest precision**: `dtype: float32` + `out_dtype: bfloat16`*
+- ***Context length**: `262,144`*
+## *Parameter Settings*:
+> [!NOTE]
+> *`Temperature=0.7`, `TopP=0.8`, `TopK=20`,`MinP=0`.*
+## *Configuration*:
+*The following YAML configuration was used to produce this model:*
+```yaml
+models:
+  - model: Qwen/Qwen3-30B-A3B-Thinking-2507
+    parameters:
+      weight: 0.9
+  - model: Qwen/Qwen3-Coder-30B-A3B-Instruct
+    parameters:
+      weight: 0.1
+merge_method: linear
+tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
+dtype: float32
+out_dtype: bfloat16
+```