"/no_think" control is unstable

#16
by Smorty100 - opened

When passing "/no_think", "/no_thinking" or similar parameters into the system prompt, qwen3 generates no thoughts, as expected.

However!

The next conversation turn DOES include a thinking section.

so a conversation might look like this:

[
    {"role": "system", "content": "/no_think"},
    {"role": "user", "content": "hiii my sweetie qwen!"},
    {"role": "assistant", "content": "<think>\n\n</think>\nhi! how may i assist u today?"},
    {"role": "user", "content": "well, how do you do?"},
    {"role": "assistant", "content": "<think>\nOkay, the user said \"blablabla\". Let's break this down step by step [...]\n</think>\nim doin fine, thanks for askin"}
]

in the example, the first reponse was correctly without thought content, but the latter did have thinking content.

Here an instance of this happening in ollama on qwen3:4b (Q4K)

image.png

I was dealing with stubborn thinking enabled despite specifically setting it to disabled in a roleplay scenario where I just wanted to see the direct response and nothing else. What I ended up doing was that I was prepending that <think>\n\n</think>\n into AI's response and it started giving me the direct answers. It's not ideal, but it works.

Sign up or log in to comment