YOYO-O1-32B / README.md
YOYO-AI's picture
Update README.md
7bfe7f5 verified
|
raw
history blame
1.49 kB
metadata
base_model:
  - qihoo360/Light-R1-32B-DS
  - Qwen/QwQ-32B
  - Qwen/Qwen2.5-Coder-32B-Instruct
  - Qwen/Qwen2.5-Coder-32B
  - qihoo360/Light-R1-32B
library_name: transformers
tags:
  - mergekit
  - merge

image/jpeg

YOYO-O1-QwQ-Light-DS-Coder-32B

Combined the most top-notch 32B inference model and code model in the entire open-source community.

Merge Method

This model was merged using the SCE merge method using Qwen/Qwen2.5-Coder-32B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: sce
models:
  # Pivot model
  - model: Qwen/Qwen2.5-Coder-32B
  # Target models
  - model: qihoo360/Light-R1-32B-DS
  - model: qihoo360/Light-R1-32B
  - model: Qwen/QwQ-32B
  - model: Qwen/Qwen2.5-Coder-32B-Instruct
base_model: Qwen/Qwen2.5-Coder-32B
parameters:
  select_topk: 1
dtype: bfloat16
tokenizer_source: Qwen/QwQ-32B
normalize: true
int8_mask: true