1 4 4

Ruochen Zhao

ruochenzhao

ruochenzhao

AI & ML interests

NLP interpretability

Recent Activity

upvoted a paper 5 days ago

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

upvoted a paper 6 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

upvoted a paper about 1 year ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

View all activity

Organizations

upvoted a paper 5 days ago

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

Paper • 2512.20578 • Published 20 days ago • 70

upvoted a paper 6 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 134

upvoted a paper about 1 year ago

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Paper • 2411.06176 • Published Nov 9, 2024 • 45

liked a Space over 1 year ago

VideoLLaMA2

🎥

162

Media understanding

liked 2 models over 1 year ago

google/gemma-2-9b-it

Text Generation • 9B • Updated Aug 27, 2024 • 136k • • 755

RuoyuChen/SMDL-Attribution

Updated Mar 29, 2024 • 2

liked a Space over 1 year ago

Auto-Arena Leaderboard

🥇

Explore and filter LLM leaderboard data

New activity in Auto-Arena/Leaderboard over 1 year ago

Add Qwen2-70b model

#1 opened over 1 year ago by

shibing624

authored a paper over 1 year ago

Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

Paper • 2405.20267 • Published May 30, 2024 • 1

upvoted a paper over 1 year ago