Wei Xiong

weqweasdas

AI & ML interests

Machine learning, RLHF

Recent Activity

updated a dataset 6 days ago
weqweasdas/remain2
published a dataset 6 days ago
weqweasdas/remain2
updated a dataset 6 days ago
weqweasdas/remain
View all activity

Organizations

reward modeling's profile picture raft_study's profile picture Directional Preference Alignment's profile picture RLHFlow's profile picture RRLHF's profile picture TIRData's profile picture feedbackagent's profile picture myselfrew's profile picture selfcorrexp's profile picture selfcorrexp2's profile picture mytestdpo's profile picture tmpmodelsave's profile picture qwselfcorr's profile picture dsrtrain's profile picture dsrselfcorr's profile picture ptllama's profile picture raftstudy's profile picture

weqweasdas's activity

New activity in RLHFlow/LLaMA3-SFT 8 months ago

LLaMA3.1-SFT

3
#3 opened 8 months ago by
jackzhang
New activity in RLHFlow/LLaMA3-SFT 10 months ago
New activity in RLHFlow/ArmoRM-Llama3-8B-v0.1 10 months ago
New activity in sfairXC/FsfairX-LLaMA3-RM-v0.1 11 months ago
New activity in weqweasdas/RM-Mistral-7B about 1 year ago

why vocab size is 32001

1
#3 opened about 1 year ago by
yechenzhi1

License

1
#2 opened about 1 year ago by
ravir123

Fix dataset link

#1 opened about 1 year ago by
ZennyKenny