mssfj
/

Llama3_2_3B_GRPO_LoRA-GSM8K-checkpoint390

Text Generation

text-generation-inference

Model card Files Files and versions Community

Llama3_2_3B_GRPO_LoRA-GSM8K-checkpoint390

Ctrl+K

Ctrl+K

1 contributor

History: 6 commits

mssfj's picture

(Trained with Unsloth)

d6e21af verified about 1 month ago