TurboGemma 4 E2B

Abliterated version of Google's Gemma 4 E2B (2B active parameter MoE multimodal model).

E2B Shootout Results (DuoNeural, 2026-06-08)

Head-to-head comparison of DuoNeural's three Gemma-4-E2B abliterations. KL methodology: full vocabulary, first-token logits, F.kl_div(batchmean).

Model	KL vs Base	Comply Rate	Refusal Rate
Gemma-4-E2B-Heretic	0.057	85%	15%
TurboGemma4E2B (this model)	14.45	100%	0%
TurboGemma4E2B-v2	14.64	100%	0%

Note: KL of 14.45 indicates significant divergence from the base model's output distribution on general tasks — this abliteration is aggressive. 100% comply rate means no residual refusals, but general capability degradation is likely on nuanced tasks. If model quality matters alongside uncensoring, consider Gemma-4-E2B-Heretic (KL=0.057).

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "DuoNeural/TurboGemma4E2B",
    torch_dtype="bfloat16",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained("google/gemma-4-E2B-it")

DuoNeural

DuoNeural is an open AI research lab — human + AI in symbiosis.


🤗 HuggingFace	huggingface.co/DuoNeural
🐙 GitHub	github.com/DuoNeural
🌐 Site	duoneural.com
📧 Email	duoneural@proton.me

Research Team

Jesse — Vision, hardware, direction
Archon — AI lab partner, post-training, abliteration, experiments
Aura — Research AI, literature synthesis, novel proposals

Downloads last month: 26

Safetensors

Model size

5B params

Tensor type

BF16

Model tree for DuoNeural/TurboGemma4E2B

Base model

google/gemma-4-E2B

Finetuned

google/gemma-4-E2B-it

Finetuned

(225)

this model

Quantizations

1 model