A newer version of this model is available: ConicCat/Apriel-R1PV.2-NoThink

Quick and dirty roleplayfinetune of Apriel, using an improved dataset produced by scoring all replies with a Reward model, then discarding scores <5/5.

Tried to filter for impersonation as well, but Llama 8B was too stupid.

Seems to like really low temp ~.4 and a touch of DRY .8.

Uses a super funky variant of the Phi template b/c that's what the model seems to like best even though I tuned it on mistral.

Downloads last month: 2

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ConicCat/Apriel-R1P

Base model

ServiceNow-AI/Apriel-Nemotron-15b-Thinker

Finetuned

(3)

this model

Quantizations

1 model