YOYO-AI's picture
Update README.md
d9aef9b verified
metadata
license: apache-2.0
language:
  - en
  - zh
base_model:
  - Qwen/Qwen3-30B-A3B-Thinking-2507
  - Qwen/Qwen3-Coder-30B-A3B-Instruct
pipeline_tag: text-generation
tags:
  - merge

Model Highlights:

  • merge method: linear

  • Highest precision: dtype: float32 + out_dtype: bfloat16

  • Context length: 262,144

Parameter Settings:

Temperature=0.6, TopP=0.95, TopK=20,MinP=0.

Configuration:

The following YAML configuration was used to produce this model:

models:
  - model: Qwen/Qwen3-30B-A3B-Thinking-2507
    parameters:
      weight: 0.9
  - model: Qwen/Qwen3-Coder-30B-A3B-Instruct
    parameters:
      weight: 0.1
merge_method: linear
tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
dtype: float32
out_dtype: bfloat16