wizardII
/

ArcherCodeR-1.5B-DAPO

Reinforcement Learning

Model card Files Files and versions Community

wizardII commited on 7 days ago

Commit

0e73ee4

·

verified ·

1 Parent(s): c7d5e79

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -7,7 +7,9 @@ base_model:
 pipeline_tag: reinforcement-learning
 tags:
 - code
-new_version: wizardII/ArcherCodeR-1.5B
 ---
@@ -81,4 +83,4 @@ Coming soon.
 ## Acknowledgements
 - We build our model upon [`DeepSeek-R1-Distill-Qwen-1.5B`](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B).
-- Training was carried out with a modified version of [verl](https://github.com/volcengine/verl).

 pipeline_tag: reinforcement-learning
 tags:
 - code
+new_version: wizardII/ArcherCodeR-1.5B-DAPO
+language:
+- en
 ---
 ## Acknowledgements
 - We build our model upon [`DeepSeek-R1-Distill-Qwen-1.5B`](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B).
+- Training was carried out with a modified version of [verl](https://github.com/volcengine/verl).