Uploaded model
- Developed by: derek33125
- License: apache-2.0
- Finetuned from model : derek33125/PA-stage1-300
This qwen2 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 4
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for derek33125/PA-stage2-500
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Finetuned
unsloth/DeepSeek-R1-Distill-Qwen-7B
Finetuned
derek33125/PA-stage1-300