zijianh
/

DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-new

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-new / generation_config.json

Commit History

Model save

368616f
verified

zijianh commited on Mar 21