Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

citrinegui
/
Qwen2.5-0.5B-Instruct_countdown34_grpo

Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Model card Files Files and versions Community
1
Qwen2.5-0.5B-Instruct_countdown34_grpo / .hydra
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
citrinegui's picture
citrinegui
Training in progress, step 50
1782b55 verified 4 months ago
  • config.yaml
    1.54 kB
    Training in progress, step 50 4 months ago
  • hydra.yaml
    3.52 kB
    Training in progress, step 50 4 months ago
  • overrides.yaml
    63 Bytes
    Training in progress, step 50 4 months ago