A newer version of this model is available:
ConicCat/Apriel-R1PV.2-NoThink
Quick and dirty roleplayfinetune of Apriel, using an improved dataset produced by scoring all replies with a Reward model, then discarding scores <5/5.
Tried to filter for impersonation as well, but Llama 8B was too stupid.
Seems to like really low temp ~.4 and a touch of DRY .8.
Uses a super funky variant of the Phi template b/c that's what the model seems to like best even though I tuned it on mistral.
- Downloads last month
- 2
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support