1 4 1

Yunxiang Zhang

yunx-z

https://yunx-z.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset 9 days ago

yunx-z/MLRC-Bench

published a dataset 9 days ago

yunx-z/MLRC-Bench

upvoted an article 24 days ago

Open R1: Update #3

View all activity

Organizations

yunx-z's activity

updated a dataset 9 days ago

yunx-z/MLRC-Bench

Viewer • Updated 9 days ago • 7 • 33

published a dataset 9 days ago

yunx-z/MLRC-Bench

Viewer • Updated 9 days ago • 7 • 33

upvoted an article 24 days ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 291

upvoted a paper 29 days ago

Process Reward Models That Think

Paper • 2504.16828 • Published Apr 23 • 16

upvoted 2 papers about 1 month ago

CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives

Paper • 2504.10823 • Published Apr 15 • 14

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

Paper • 2504.09702 • Published Apr 13 • 18

authored a paper about 1 month ago

BiRdQA: A Bilingual Dataset for Question Answering on Tricky Riddles

Paper • 2109.11087 • Published Sep 23, 2021

commented a paper about 1 month ago

MLRC-Bench: Can Language Agents Solve Machine Learning Research Challenges?

Paper • 2504.09702 • Published Apr 13 • 18 •

updated a Space about 1 month ago

MLRC-BENCH

📊

Display model performance metrics

published a Space about 1 month ago

MLRC-BENCH

📊

Display model performance metrics

liked a Space 7 months ago

CoI Agent

🐢

Online demo of paper: Chain of Ideas: Revolutionizing Resear