YOYO-AI's picture
Update README.md
772deeb verified
|
raw
history blame
843 Bytes
---
license: apache-2.0
language:
- en
- zh
base_model:
- Qwen/Qwen3-30B-A3B-Thinking-2507
- Qwen/Qwen3-30B-A3B-Instruct-2507
pipeline_tag: text-generation
tags:
- merge
---
## *Model Highlights:*
- ***merge method**: `nuslerp`*
- ***Highest precision**: `dtype: float32` + `out_dtype: bfloat16`*
- ***Context length**: `262,144`*
## *Parameter Settings*:
> [!NOTE]
> *`Temperature=0.7`, `TopP=0.8`, `TopK=20`,`MinP=0`.*
## *Configuration*:
*The following YAML configuration was used to produce this model:*
```yaml
models:
- model: Qwen/Qwen3-30B-A3B-Thinking-2507
parameters:
weight: 1
- model: Qwen/Qwen3-30B-A3B-Instruct-2507
parameters:
weight: 1
merge_method: nuslerp
tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
parameters:
normalize: true
int8_mask: false
dtype: float32
out_dtype: bfloat16
```