janhq
/
250412-llama-3.2-3b-instruct-grpo-02-no-retry

Model card Files Files and versions Metrics Training metrics Community