Model Highlights:

  • merge method: nuslerp

  • Highest precision: dtype: float32 + out_dtype: bfloat16

  • Context length: 262,144

Parameter Settings:

Temperature=0.7, TopP=0.8, TopK=20,MinP=0.

Configuration:

The following YAML configuration was used to produce this model:

models:
  - model: Qwen/Qwen3-30B-A3B-Thinking-2507
    parameters:
      weight: 1
  - model: Qwen/Qwen3-30B-A3B-Instruct-2507
    parameters:
      weight: 1
merge_method: nuslerp
tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
parameters:
  normalize: true
  int8_mask: false
dtype: float32
out_dtype: bfloat16
Downloads last month
33
Safetensors
Model size
30.5B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for YOYO-AI/Qwen3-30B-A3B-Mixture-2507