base_model: unsloth/qwen2.5-1.5b-instruct-unsloth-bnb-4bit | |
library_name: peft | |
# Model Card for Model ID | |
<!-- Provide a quick summary of what the model is/does. --> | |
I believe the run is associated with: https://wandb.ai/CoRL-heist-2025/decision_grpo/runs/6kciyrxy/overview |