merged

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Passthrough merge method.

PURELY EXPERIMENTAL !!! I DO NOT KNOW IF IT WILL WORK MY INTERNET SPEEDS ARE SO FLIPPIN SLOW I CANT EVEN TEST IT ;-;

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

dtype: bfloat16
merge_method: passthrough
modules:
  default:
    slices:
    - sources:
      - layer_range: [0, 9]
        model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
    - sources:
      - layer_range: [5, 14]
        model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
    - sources:
      - layer_range: [10, 19]
        model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
    - sources:
      - layer_range: [15, 24]
        model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
    - sources:
      - layer_range: [20, 32]
        model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
Downloads last month
5
Safetensors
Model size
6.37B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for neighbooo/DeepSeek-R1-Distill-Llama-12B-unsloth-bnb-4bit