PRM and fine-tuned LLM used in our PURE github repo: https://github.com/CJReinforce/PURE
Jie Cheng
jinachris
AI & ML interests
Reinforcement learning, LLM
Recent Activity
liked
a model
22 days ago
stepfun-ai/step3-fp8
liked
a model
22 days ago
stepfun-ai/step3
upvoted
a
collection
22 days ago
Step3
Organizations
None yet