zijianh
/

DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-high-0_5-new

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-high-0_5-new

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

zijianh's picture

initial commit

f2b7775 verified about 1 month ago

.gitattributes

1.52 kB

initial commit about 1 month ago