zijianh
/

DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-high-0_1-new

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-high-0_1-new / all_results.json

Commit History

Model save

d573650
verified

zijianh commited on about 1 month ago