|
--- |
|
license: apache-2.0 |
|
language: |
|
- en |
|
- zh |
|
base_model: |
|
- Qwen/Qwen3-30B-A3B-Thinking-2507 |
|
- Qwen/Qwen3-Coder-30B-A3B-Instruct |
|
pipeline_tag: text-generation |
|
tags: |
|
- merge |
|
--- |
|
|
|
## *Model Highlights:* |
|
|
|
- ***merge method**: `linear`* |
|
|
|
- ***Highest precision**: `dtype: float32` + `out_dtype: bfloat16`* |
|
|
|
- ***Context length**: `262,144`* |
|
|
|
## *Parameter Settings*: |
|
> [!NOTE] |
|
> *`Temperature=0.6`, `TopP=0.95`, `TopK=20`,`MinP=0`.* |
|
## *Configuration*: |
|
*The following YAML configuration was used to produce this model:* |
|
|
|
```yaml |
|
models: |
|
- model: Qwen/Qwen3-30B-A3B-Thinking-2507 |
|
parameters: |
|
weight: 0.9 |
|
- model: Qwen/Qwen3-Coder-30B-A3B-Instruct |
|
parameters: |
|
weight: 0.1 |
|
merge_method: linear |
|
tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507 |
|
dtype: float32 |
|
out_dtype: bfloat16 |
|
``` |
|
|
|
|