Fine tuning experiment details at https://github.com/Yeok-c/grpo-gsm8k-demo

Downloads last month
4
Safetensors
Model size
1.78B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support