File size: 121 Bytes
0228ce2 |
1 2 3 4 5 6 7 8 |
---
license: mit
tags:
- unsloth
- gsm8k
---
Fine tuning experiment details at https://github.com/Yeok-c/grpo-gsm8k-demo |
0228ce2 |
1 2 3 4 5 6 7 8 |
---
license: mit
tags:
- unsloth
- gsm8k
---
Fine tuning experiment details at https://github.com/Yeok-c/grpo-gsm8k-demo |