YOYO-AI
/

Qwen3-30B-A3B-CoderThinking-YOYO-linear

Text Generation

Model card Files Files and versions

Qwen3-30B-A3B-CoderThinking-YOYO-linear / README.md

YOYO-AI's picture

Update README.md

d9aef9b verified about 1 month ago

|

800 Bytes

	---
	license: apache-2.0
	language:
	- en
	- zh
	base_model:
	- Qwen/Qwen3-30B-A3B-Thinking-2507
	- Qwen/Qwen3-Coder-30B-A3B-Instruct
	pipeline_tag: text-generation
	tags:
	- merge
	---

	## Model Highlights:

	- *merge method: `linear`*

	- *Highest precision: `dtype: float32` + `out_dtype: bfloat16`*

	- *Context length: `262,144`*

	## Parameter Settings:
	> [!NOTE]
	> `Temperature=0.6`, `TopP=0.95`, `TopK=20`,`MinP=0`.
	## Configuration:
	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: Qwen/Qwen3-30B-A3B-Thinking-2507
	parameters:
	weight: 0.9
	- model: Qwen/Qwen3-Coder-30B-A3B-Instruct
	parameters:
	weight: 0.1
	merge_method: linear
	tokenizer_source: Qwen/Qwen3-30B-A3B-Thinking-2507
	dtype: float32
	out_dtype: bfloat16
	```