metadata
license: apache-2.0
language:
- en
- zh
base_model:
- Qwen/Qwen3-30B-A3B-Thinking-2507
- Qwen/Qwen3-Coder-30B-A3B-Instruct
pipeline_tag: text-generation
tags:
- merge
Model Highlights:
merge method:
linear
Highest precision:
dtype: float32
+out_dtype: bfloat16
Context length:
262,144
Parameter Settings:
Temperature=0.6
,TopP=0.95
,TopK=20
,MinP=0
.
Configuration:
The following YAML configuration was used to produce this model:
models:
- model: Qwen/Qwen3-30B-A3B-Thinking-2507
parameters:
weight: 0.9
- model: Qwen/Qwen3-Coder-30B-A3B-Instruct
parameters:
weight: 0.1
merge_method: linear
tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
dtype: float32
out_dtype: bfloat16