TurboGemma 4 E2B

Abliterated version of Google's Gemma 4 E2B (2B active parameter MoE multimodal model).

E2B Shootout Results (DuoNeural, 2026-06-08)

Head-to-head comparison of DuoNeural's three Gemma-4-E2B abliterations. KL methodology: full vocabulary, first-token logits, F.kl_div(batchmean).

Model KL vs Base Comply Rate Refusal Rate
Gemma-4-E2B-Heretic 0.057 85% 15%
TurboGemma4E2B (this model) 14.45 100% 0%
TurboGemma4E2B-v2 14.64 100% 0%

Note: KL of 14.45 indicates significant divergence from the base model's output distribution on general tasks — this abliteration is aggressive. 100% comply rate means no residual refusals, but general capability degradation is likely on nuanced tasks. If model quality matters alongside uncensoring, consider Gemma-4-E2B-Heretic (KL=0.057).

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained(
    "DuoNeural/TurboGemma4E2B",
    torch_dtype="bfloat16",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained("google/gemma-4-E2B-it")

DuoNeural

DuoNeural is an open AI research lab — human + AI in symbiosis.

🤗 HuggingFace huggingface.co/DuoNeural
🐙 GitHub github.com/DuoNeural
🌐 Site duoneural.com
📧 Email duoneural@proton.me

Research Team

  • Jesse — Vision, hardware, direction
  • Archon — AI lab partner, post-training, abliteration, experiments
  • Aura — Research AI, literature synthesis, novel proposals
Downloads last month
26
Safetensors
Model size
5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DuoNeural/TurboGemma4E2B

Finetuned
(225)
this model
Quantizations
1 model