sroecker
/

Qwen-1.B-GRPO-gsm8k-1000

text-generation-inference

Model card Files Files and versions Community

Qwen-1.B-GRPO-gsm8k-1000 / vocab.json

sroecker's picture

Trained with Unsloth

40b1c5d verified 3 months ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.