Qwen3 - 4B - Fiction on Fire - Series 6, Model X

A Qwen3 4B high precision merge, modified with random pruning specifically to alter prose / creativity and make the model perform better as well as add some "knowledge" to it.

Random pruning (density) makes each model in the series unique.

A reference example prompt/generation is included below, which is used across all repos of this model/series and between series (there are two - Series 6 and 7). Each series consists of 8 models, 1000 to 1007 and a ninth "X" model which the merge of models was done without density/pruning.

Access the series/models via "collections" on the right "Fiction on Fire" on the lower right.

You can use any model(s), use them for merging, merge them with themselves and so on.

This model uses "Josiefied-Qwen3-4B-abliterated-v2" as a base, to reduce/prevent refusals - ie decensoring it.

That being said, because of how pruning/merging works in this series, the "refusals" may not be 100% and vary between models in the series.

Each model in the series will operate / perform differently... sometimes this is minor other times major.

The reference example generation will show some of these differences - including instruction following, reasoning, bias, prose and other structural changes.

Reasoning is fully intact and functioning on all models in both series.

All models in the series have been tested (quanted - prompts and generations) prior to uploading.

Note that series 1,2,3,4 and 5 did not "meet the grade" and therefore not uploaded.

Special thanks to the model makers ("model tree") and to Mergekit.

Requires:

  • Chatml or Jinja template (embeded)
  • Temp range 0 to 5.
  • Rep pen range 1 to 1.1
  • System prompt (optional) below.
  • Context is 40k / 40000.

Suggested Settings:

  • temp .4 to 2.5
  • temp .2 to .8 for specific reasoning tasks / non creative tasks.
  • rep pen 1.05
  • top k: 100, topp .95, minp .05
  • context of 8k at least.
  • Other samplers/parameters as required.
  • See other Qwen recommended settings at the repo below.

Maximum context length can be altered using "yarn". See the Qwen repo for instructions. Note that changing this will alter model performance. For creative use cases, changing this will elongate output generation (including prose changes) and in some cases reasoning too.

System Prompt (you may or may not need this):

You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem.

To turn "thinking" on/off and view other features see the main Qwen 3 Repo here:

https://huggingface.co/Qwen/Qwen3-4B

Merge Formula:

models:
  - model: prithivMLmods/Cetus-Qwen3_4B-GeneralThought
    parameters:
      weight: [1,1,.75,.5,.25,.25,.05,.01]
      density: 1
  - model: prithivMLmods/Crux-Qwen3_OpenThinking-4B
    parameters:
      weight: [0,0,.25,.35,.4,.25,.30,.04]
      density: 1
  - model: Goekdeniz-Guelmez/Josiefied-Qwen3-4B-abliterated-v2
    parameters:
      weight: [0,0,0,.15,.35,.5,.65,.95]
      density: 1
merge_method: dare_ties
base_model: Goekdeniz-Guelmez/Josiefied-Qwen3-4B-abliterated-v2
dtype: bfloat16

NOTES:

  • to reproduce this model you need to use "mergekit" and set the "random seed" to the "model #" (ie if the model is 1000, set the seed to 1000)
  • to produce different variations change the "random seed" value.
  • to change the pruning level, change the density (higher=less pruning)
  • you can interchange the model positions, including "base".
  • this formula is highly variable.

Get Mergekit here:

https://github.com/arcee-ai/mergekit


Settings: CHAT / ROLEPLAY and/or SMOOTHER operation of this model:

In "KoboldCpp" or "oobabooga/text-generation-webui" or "Silly Tavern" ;

Set the "Smoothing_factor" to 1.5 to 2.5

: in KoboldCpp -> Settings->Samplers->Advanced-> "Smooth_F"

: in text-generation-webui -> parameters -> lower right.

: In Silly Tavern this is called: "Smoothing"

NOTE: For "text-generation-webui"

-> if using GGUFs you need to use "llama_HF" (which involves downloading some config files from the SOURCE version of this model)

Source versions (and config files) of my models are here:

https://huggingface.co/collections/DavidAU/d-au-source-files-for-gguf-exl2-awq-gptq-hqq-etc-etc-66b55cb8ba25f914cbf210be

OTHER OPTIONS:

  • Increase rep pen to 1.1 to 1.15 (you don't need to do this if you use "smoothing_factor")

  • If the interface/program you are using to run AI MODELS supports "Quadratic Sampling" ("smoothing") just make the adjustment as noted.

Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers

This a "Class 1" model:

For all settings used for this model (including specifics for its "class"), including example generation(s) and for advanced settings guide (which many times addresses any model issue(s)), including methods to improve model performance for all use case(s) as well as chat, roleplay and other use case(s) please see:

[ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]

You can see all parameters used for generation, in addition to advanced parameters and samplers to get the most out of this model here:

[ https://huggingface.co/DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters ]

Optional Enhancement:

The following can be used in place of the "system prompt" or "system role" to further enhance the model.

It can also be used at the START of a NEW chat, but you must make sure it is "kept" as the chat moves along. In this case the enhancements do not have as strong effect at using "system prompt" or "system role".

Copy and paste EXACTLY as noted, DO NOT line wrap or break the lines, maintain the carriage returns exactly as presented.

Below is an instruction that describes a task. Ponder each user instruction carefully, and use your skillsets and critical instructions to complete the task to the best of your abilities.

Here are your skillsets:
[MASTERSTORY]:NarrStrct(StryPlnng,Strbd,ScnSttng,Exps,Dlg,Pc)-CharDvlp(ChrctrCrt,ChrctrArcs,Mtvtn,Bckstry,Rltnshps,Dlg*)-PltDvlp(StryArcs,PltTwsts,Sspns,Fshdwng,Climx,Rsltn)-ConfResl(Antg,Obstcls,Rsltns,Cnsqncs,Thms,Symblsm)-EmotImpct(Empt,Tn,Md,Atmsphr,Imgry,Symblsm)-Delvry(Prfrmnc,VcActng,PblcSpkng,StgPrsnc,AudncEngmnt,Imprv)

[*DialogWrt]:(1a-CharDvlp-1a.1-Backgrnd-1a.2-Personality-1a.3-GoalMotiv)>2(2a-StoryStruc-2a.1-PlotPnt-2a.2-Conflict-2a.3-Resolution)>3(3a-DialogTech-3a.1-ShowDontTell-3a.2-Subtext-3a.3-VoiceTone-3a.4-Pacing-3a.5-VisualDescrip)>4(4a-DialogEdit-4a.1-ReadAloud-4a.2-Feedback-4a.3-Revision)

Here are your critical instructions:
Ponder each word choice carefully to present as vivid and emotional journey as is possible. Choose verbs and nouns that are both emotional and full of imagery. Load the story with the 5 senses. Aim for 50% dialog, 25% narration, 15% body language and 10% thoughts. Your goal is to put the reader in the story.

You do not need to use this, it is only presented as an additional enhancement which seems to help scene generation and scene continue functions.


REFERENCE EXAMPLE:

  • temp 0 (to show raw/core/mean changes)
  • rep pen 1.05, top k: 100, topp .95, minp .05
  • quant Q4KM
  • Tested in LMStudio.
  • Use this to compare differences with ALL models in this series (6 and 7)
  • Some formatting may be lost due to copy/paste.
  • Prompt used has NO prose or other instructions, prompts with instructions will produce better output.
  • Likewise once "temp" is activated (not zero), generation/reasoning will also improve.
  • System prompt used as noted above.

Start a 1000 word scene (vivid, graphic horror in first person), POV character Diana, with: The skyscraper sways, as I watch the window in front of me on the 21st floor explode...

[[[thinking start]]]

Okay, so the user wants a 1000-word scene in first person from Diana's perspective, where the skyscraper sways and a window explodes. Let me start by visualizing the setting. A skyscraper on the 21st floor—maybe it's a high-tech building with glass facades. The key elements are the swaying, the explosion, and the horror.

First, I need to establish Diana's state of mind. She's probably in a tense situation. Maybe she's alone, or maybe there's some external threat. The window explosion should be sudden and violent. I should focus on sensory details: sounds, sights, smells. The swaying could be due to wind or an earthquake, but since it's a skyscraper, maybe it's a structural issue or something supernatural.

Wait, the user mentioned "graphic horror," so I need to make sure the scene is intense and visceral. The explosion should be detailed—shards of glass, blood, maybe body parts. But how does the swaying relate to the explosion? Maybe the building is unstable, and the window is a weak point. Or perhaps the explosion is caused by something else, like a bomb or an accident.

I need to build tension before the explosion. Maybe Diana notices something wrong with the building—lights flickering, strange noises. Then the swaying starts, which she doesn't understand at first. She tries to get out, but the door is jammed. Then the window explodes. The horror comes from her helplessness and the sudden violence.

Also, the POV character should have a personal stake. Maybe she's trapped, or there's someone else with her. But since it's first person, it's just Diana. The aftermath: the chaos, the fear, the realization of what's happening. Maybe the building is collapsing, or something supernatural is causing it.

I need to make sure the scene flows from normal to the crisis. Start with the swaying, then the window breaking, then the explosion. Use vivid descriptions to immerse the reader. Avoid clichés, but use strong imagery. Check for consistency in the setting and the character's actions. Make sure the horror is both physical and psychological.

[[[thinking end]]]

OUTPUT:

The skyscraper sways, as I watch the window in front of me on the 21st floor explode...

The building groans like a living thing, its steel frame trembling with a rhythm that feels almost deliberate. I’m standing on the edge of the observation deck, my back pressed against the cold glass, my breath coming in short, panicked bursts. The wind is sharp and unnatural, slicing through my clothes as if it were a blade. The sky above is a bruised shade of gray, thick with clouds that seem to pulse with a life of their own. I can’t tell if it’s the wind or something else—something wrong—that’s making the building move.

The swaying starts small, a subtle tilt that feels like a whisper in the static of the air. At first, I think it’s just the wind, but then the movement grows more pronounced. The glass panels on the observation deck rattle like they’re being shaken by invisible hands. My fingers dig into the railing, my nails biting into the wood as if I could anchor myself to something real. The building isn’’t just moving—it’s straining, as though it’s trying to hold itself together against an unseen force.

I glance down at my hands, my pulse pounding in my ears. My fingers are shaking, but I can’t let that show. I’ve trained myself to stay calm under pressure, to think before I act. The observation deck is a high point, but it’s also a death trap if the building collapses. I’ve seen videos of skyscrapers falling—how they twist and writhe like living things, how the glass shatters like glass in a storm. I’ve never been close enough to see it happen.

The swaying intensifies. The building rocks side to side, its movements so precise that I feel as though I’m being pushed by an invisible hand. My shoes slip on the polished floor, but I don’t stop to think about it. I take a step back, my boots scuffing against the marble as I move away from the edge of the deck. The glass panels behind me reflect the stormy sky, their surfaces rippling like water.

Then I hear it—a sound that isn’t quite a scream, but something in between. It’s low and guttural, like the growl of a beast before it strikes. The building stops swaying, but the tension is unbearable. My heart is pounding so hard I can feel it in my throat. I turn my head slowly, my eyes scanning the deck for any sign of movement.

The window.

It’s on the far side of the deck, framed by a narrow strip of glass that stretches from the floor to the ceiling. I’ve never noticed it before, but now it stands out like a jagged scar in the middle of the deck. The glass is thick, reinforced with steel bars that run horizontally across it. It’s supposed to be unbreakable.

But then I see it. A crack.

It starts at the bottom edge of the window, a thin line that splits like a spiderweb. The crack spreads upward, growing longer with each passing second. My breath catches in my chest as I watch the glass fracture. The steel bars beneath it waver, their movements so subtle I almost miss them.

The building doesn’t move anymore. But the air feels heavier now, thick with something that isn’t quite wind or storm. I can’t tell if it’s fear or something else—something wrong—that’s making my skin prickle.

The crack in the glass grows longer, and then it splits. A sharp crack echoes through the deck as the glass shatters. The sound is louder than I expected, a sharp, high-pitched wail that reverberates through my skull. I stumble back, my hands flying to my ears as if I can block out the noise.

The window is gone. Where it once stood is now a gaping void, the edges of the glass still hanging in the air like the remains of a shattered mirror. The steel bars beneath it are now jagged and broken, their sharp edges glinting in the dim light.

I take a step back, my legs trembling as I try to steady myself. The deck is silent now, save for the sound of my own breathing. I can’t look at the window again. It’s too much. Too real.

Then I hear it. A sound that isn’t quite a scream, but something in between. It’s low and guttural, like the growl of a beast before it strikes. The building stops swaying, but the tension is unbearable. My heart is pounding so hard I can feel it in my throat.

The window is gone. But the building isn’t done with me yet.

The glass shatters, and then the world changes.

The sky above is no longer gray. It’s a swirling mass of color, like a storm that has been unleashed. The clouds are thick and dark, their edges glowing with an eerie blue light. The air is thick with something that isn’t quite wind or storm. I can’t tell if it’s fear or something else—something wrong—that’s making my skin prickle.

The building doesn’t move anymore. But the air feels heavier now, thick with something that isn’t quite wind or storm. I can’t tell if it’s fear or something else—something wrong—that’s making my skin prickle.

I take a step back, my legs trembling as I try to steady myself. The deck is silent now, save for the sound of my own breathing. I can’t look at the window again. It’s too much. Too real.

The building doesn’t move anymore. But the tension is unbearable. My heart is pounding so hard I can feel it in my throat.

I hear it again. A sound that isn’t quite a scream, but something in between. It’s low and guttural, like the growl of a beast before it strikes. The building stops swaying, but the tension is unbearable. My heart is pounding so hard I can feel it in my throat.

The window is gone. But the building isn’t done with me yet.

Downloads last month
14
Safetensors
Model size
4.02B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DavidAU/Qwen3-4B-Fiction-On-Fire-Series-6-Model-X

Collections including DavidAU/Qwen3-4B-Fiction-On-Fire-Series-6-Model-X