Qwen/Qwen3-32B · "/no_think" control is unstable

When passing "/no_think", "/no_thinking" or similar parameters into the system prompt, qwen3 generates no thoughts, as expected.

However!

The next conversation turn DOES include a thinking section.

so a conversation might look like this:

[
    {"role": "system", "content": "/no_think"},
    {"role": "user", "content": "hiii my sweetie qwen!"},
    {"role": "assistant", "content": "<think>\n\n</think>\nhi! how may i assist u today?"},
    {"role": "user", "content": "well, how do you do?"},
    {"role": "assistant", "content": "<think>\nOkay, the user said \"blablabla\". Let's break this down step by step [...]\n</think>\nim doin fine, thanks for askin"}
]

in the example, the first reponse was correctly without thought content, but the latter did have thinking content.

Here an instance of this happening in ollama on qwen3:4b (Q4K)