L3.3-Shakudo-70b - EXL3 3.0bpw H6

This is a 3bpw EXL3 quant of Steelskull/L3.3-Shakudo-70b

This quant was made using exllamav3-0.0.5 with '--cal_cols 4096' (instead of default 2048) which in my experience improves quant quality a bit

3bpw fits in 32GB VRAM on Windows with around 18-20k Q8 context

I tested this quant shortly in some random RPs (including ones over 8k and 16k context) and it seems to work fine

Prompt Templates

Uses Llama 3 Instruct format. Supports thinking with "<thinking>" prefill in assistant response.

Original readme below

L3.3-Shakudo-70b

Created by Steelskull Steelskull → Support on Ko-fi

Model Information

L3.3-Shakudo-70b

Llama 3.3 Multi-Stage Merge 70b Parameters V0.8

Model Composition

Final Merge: L3.3-Shakudo-70b ▼

TheSkullery/L3.3-M1-Hydrargyrum-70B

TheSkullery/L3.3-M2-Hydrargyrum-70B
Model 1: L3.3-M1-Hydrargyrum-70B ▼

Sao10K/L3.1-70B-Hanami-x1

TheDrummer/Anubis-70B-v1

ArliAI/Llama-3.3-70B-ArliAI-RPMax-v1.4

BeaverAI/Shimmer-70B-v1a

TheDrummer/Fallen-Llama-3.3-70B-v1
Model 2: L3.3-M2-Hydrargyrum-70B ▼

Sao10K/Llama-3.3-70B-Vulpecula-r1

Sao10K/70B-L3.3-Cirrus-x1

EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0

LatitudeGames/Wayfarer-Large-70B-Llama-3.3

Sao10K/L3.3-70B-Euryale-v2.3
Base Model: L3.3-Cogmoblated-70B ▼

abacusai/Dracarys2-Llama-3.1-70B-Instruct

watt-ai/watt-tool-70B

deepcogito/cogito-v1-preview-llama-70B

TheDrummer/Anubis-70B-v1

SicariusSicariiStuff/Negative_LLAMA_70B

Ppoyaa/MythoNemo-L3.1-70B-v1.0

nbeerbower/Llama-3.1-Nemotron-lorablated-70B (Base)

Model Creation Process

L3.3-Shakudo-70b is the result of a multi-stage merging process by Steelskull, designed to create a powerful and creative roleplaying model with a unique flavor. The creation process involved several advanced merging techniques, including weight twisting, to achieve its distinct characteristics.

Stage 1: The Cognitive Foundation & Weight Twisting

The process began by creating a cognitive and tool-use focused base model, L3.3-Cogmoblated-70B. This was achieved through a `model_stock` merge of several models known for their reasoning and instruction-following capabilities. This base was built upon `nbeerbower/Llama-3.1-Nemotron-lorablated-70B`, a model intentionally "ablated" to skew refusal behaviors. This technique, known as weight twisting, helps the final model adopt more desirable response patterns by building upon a foundation that is already aligned against common refusal patterns.

Stage 2: The Twin Hydrargyrum - Flavor and Depth

Two distinct models were then created from the Cogmoblated base:

L3.3-M1-Hydrargyrum-70B: This model was merged using `SCE`, a technique that enhances creative writing and prose style, giving the model its unique "flavor." The Top_K for this merge were set at 0.22 .
L3.3-M2-Hydrargyrum-70B: This model was created using a `Della_Linear` merge, which focuses on integrating the "depth" of various roleplaying and narrative models. The settings for this merge were set at: (lambda: 1.1) (weight: 0.2) (density: 0.7) (epsilon: 0.2)

Final Stage: Shakudo

The final model, L3.3-Shakudo-70b, was created by merging the two Hydrargyrum variants using a 50/50 `nuslerp`. This final step combines the rich, creative prose (flavor) from the SCE merge with the strong roleplaying capabilities (depth) from the Della_Linear merge, resulting in a model with a distinct and refined narrative voice.

A special thank you to Nectar.ai for their generous support of the open-source community and my projects.

Additionally, a heartfelt thanks to all the Ko-fi supporters who have contributed, your generosity is deeply appreciated and helps keep this work going and the Pods spinning.

Recommended Sampler Settings

Static Temperature: 1.0 - 1.2

Min P: 0.02 - 0.025

DRY:

- Multiplier: 0.8

- Base: 1.74

- Length: 4-6

Good Starting Templates & Prompts

Hamon v1 → by @Steel > Big-picture storytelling guide with world-building focus, set dialogue/narration split, and general writing rules.

Shingane v1 → by @Steel > Simplified sysprompt based on Hamon.

Kesshin v1 → by @Steel > A Hamon rethink using a Character-focused sys prompt that tracks what characters know and how they learn things, with strict interaction rules.

Kamae TTRPG v1 → by @Steel > TTRPG Game Master framework emphasizing player agency, world consistency, and adaptive session management with mechanical integration.

Kamae lite v1 → by @Steel > Simplified sysprompt based on Kamae.

Support & Community:

Join Discord

DeusImperator
/

L3.3-Shakudo-70b_exl3_3.0bpw_H6

L3.3-Shakudo-70b - EXL3 3.0bpw H6

Prompt Templates

Original readme below

L3.3-Shakudo-70b

Model Information

L3.3-Shakudo-70b

Model Composition

Model Creation Process

Stage 1: The Cognitive Foundation & Weight Twisting

Stage 2: The Twin Hydrargyrum - Flavor and Depth

Final Stage: Shakudo

Recommended Sampler Settings

Good Starting Templates & Prompts

Support & Community:

Model tree for DeusImperator/L3.3-Shakudo-70b_exl3_3.0bpw_H6

L3.3-Shakudo-70b - EXL3 3.0bpw H6

Prompt Templates

Original readme below

L3.3-Shakudo-70b

🏆 Top Supporters

🤝 Valued Partners

Model Information

L3.3-Shakudo-70b

Model Composition

Model Creation Process

Stage 1: The Cognitive Foundation & Weight Twisting

Stage 2: The Twin Hydrargyrum - Flavor and Depth

Final Stage: Shakudo

Recommended Sampler Settings

Good Starting Templates & Prompts

Support & Community:

Model tree for DeusImperator/L3.3-Shakudo-70b_exl3_3.0bpw_H6