mkurman
/

llama-3.2-MEDIT-3B-o1-GRPO-LLM-Eval

Text Generation

GRPO-LLM-Evaluator

text-generation-inference

Model card Files Files and versions Community

llama-3.2-MEDIT-3B-o1-GRPO-LLM-Eval / model-00002-of-00002.safetensors

Commit History

Upload 8 files

e363456
verified

mkurman commited on Feb 22