Base Model https://huggingface.co/ServiceNow-AI/Apriel-Nemotron-15b-Thinker
phase 1: https://huggingface.co/Nitral-AI/Nemotron-15b-Thinker-instruct
phase-2 test chatml (this model)
Data: 1024 reasoning RP multi-turn chat pairs.
BS 32, Rank 64, Lr 2e-4, 2 epochs. 8192 context.
- Downloads last month
- 2
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support