Weyaxi's picture
Upload folder using huggingface_hub
564c025
|
raw
history blame
693 Bytes
metadata
license: apache-2.0

SauerkrautLM-UNA-SOLAR-Instruct

This is the model for SauerkrautLM-UNA-SOLAR-Instruct. I used mergekit to merge models.

Yaml Config


slices:
  - sources:
      - model: VAGOsolutions/SauerkrautLM-SOLAR-Instruct
        layer_range: [0, 48]
      - model: fblgit/UNA-SOLAR-10.7B-Instruct-v1.0
        layer_range: [0, 48]

merge_method: slerp
base_model: upstage/SOLAR-10.7B-Instruct-v1.0

parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5 # fallback for rest of tensors
tokenizer_source: union

dtype: bfloat16