2 2 1

Kai Chen

hellock

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Pre-Trained Policy Discriminators are General Reward Models

upvoted a paper 5 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

updated a model 6 months ago

internlm/internlm3-8b-instruct

View all activity

Organizations

upvoted a paper about 13 hours ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published about 24 hours ago • 26

upvoted a paper 5 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

updated a model 6 months ago

internlm/internlm3-8b-instruct

Text Generation • 9B • Updated Feb 11 • 66.5k • 222

New activity in internlm/internlm3-8b-instruct 6 months ago

Context Length

#9 opened 6 months ago by

PSM24

updated a collection 6 months ago

InternLM3

Collection

6 items • Updated Feb 11 • 26

liked a model 6 months ago

internlm/internlm3-8b-instruct

Text Generation • 9B • Updated Feb 11 • 66.5k • 222

updated a collection about 1 year ago

InternLM2-Math

Collection

16 items • Updated Feb 11 • 9

updated a collection over 1 year ago

InternLM2

Collection

7 items • Updated Feb 11 • 9

updated a Space over 1 year ago

README

🐨

Kai Chen

AI & ML interests

Recent Activity

Organizations

hellock's activity

Context Length

README