Openrouter Reasoning? (+ Questions about prompting)

#12
by cinnamoo0 - opened

Hi hi. I've been trying out the new model through Openrouter. I assume they still disable thinking by default, but I was wondering if there's a prompt to enable it? I use JanitorAI for reference.

I was also wondering if the custom prompt I use works well for R1T2, or if I should look for another, https://rentry.co/molekprompt#೯-𖥻-moleks-base-prompt-version-07-ᰋ

Currently, I still struggle with feelings that the LLM just isn't... reading my prompts? It has been calling my pink-haired persona's hair silver. Just wondering if there was a general ''fix'' for any of this.

TNG Technology Consulting GmbH org
edited Jul 8

Greetings,
thanks for your questions.

A) On OpenRouter, reasoning is enabled for R1T2, as you can see by looking at the graph at:
https://openrouter.ai/tngtech/deepseek-r1t2-chimera:free/activity
For example, it is now about 6 hours after the model became live on OR, and it has 144M input tokens, 7.31M reasoning tokens and 5.48M completion tokens.

B) Regarding custom RP-prompts: We have no experience in that area. If the original R1T Chimera was working for you in that respect, maybe it is worth sticking with R1T? Or try some slight prompt variations?

C) In case you are using the OpenRouter chat, it has a generic bug when used with reasoning models such as R1T2, R1-0528, Microsoft R1 or Qwen3 235B A22B: If you run a long reasoning query and stop/interrupt it while reasoning, and then ask a next question, the previous question will be restarted, not the next question answered. That can create the true impression of the reasoning LLM not reading the last prompt. But that is not the LLM's fault. Also, this should not appear when using a different chat client, of course.

D) We did design / optimize R1T2 to be good in topics like mathematics and coding, big thanks to the DeepSeek parent models. But we also tried to create R1T2 to have a creative, very funny personality. At least from a nerd's perspective, its programming and mathematical jokes can be hilarious. This natural overflowing creativity of the model may interfere with RP behaviour, but at this moment I would not know how to quantify this.

I hope this helps.

Thank you for the response! <3
I know this is a bit far-reached, but will TNG ever make a model specifically for roleplays?

TNG Technology Consulting GmbH org

Hello,
I guess that is unlikely at the moment. Almost all of us are software developers, which makes coder models and general purpose, business capable models most interesting for us.
Cheers!

TNGHK changed discussion status to closed

Sign up or log in to comment