Rui Yang

Ray2333

AI & ML interests

Deep Reinforcement Learning

Recent Activity

Organizations

DynaMath Team's profile picture RandomSampling's profile picture MergeBench's profile picture MergeBench-Llama-3B's profile picture EmbodiedBench's profile picture MergeBench-gemma-2-9b's profile picture

Ray2333's activity

New activity in microsoft/Magma-8B about 2 months ago

generation_args in the example

2
1
#10 opened about 2 months ago by
Ray2333
New activity in EmbodiedBench/EB-Manipulation about 2 months ago

Add dataset card

#1 opened about 2 months ago by
nielsr
New activity in Ray2333/Gemma-2B-rewardmodel-baseline about 2 months ago
New activity in Ray2333/GRM-Llama3.2-3B-rewardmodel-ft 5 months ago

Model Size

1
#1 opened 5 months ago by
szhang120
New activity in Ray2333/gpt2-large-harmless-reward_model 9 months ago

a bug when loading model

1
#2 opened 9 months ago by
ssmmzz
New activity in Ray2333/gpt2-large-harmless-reward_model about 1 year ago

How to train the model

1
#1 opened about 1 year ago by
mike2000