merged
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the Passthrough merge method.
PURELY EXPERIMENTAL !!! I DO NOT KNOW IF IT WILL WORK MY INTERNET SPEEDS ARE SO FLIPPIN SLOW I CANT EVEN TEST IT ;-;
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
dtype: bfloat16
merge_method: passthrough
modules:
default:
slices:
- sources:
- layer_range: [0, 9]
model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
- sources:
- layer_range: [5, 14]
model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
- sources:
- layer_range: [10, 19]
model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
- sources:
- layer_range: [15, 24]
model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
- sources:
- layer_range: [20, 32]
model: unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
- Downloads last month
- 5
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for neighbooo/DeepSeek-R1-Distill-Llama-12B-unsloth-bnb-4bit
Base model
deepseek-ai/DeepSeek-R1-Distill-Llama-8B