Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Shuaiyi Nie
ShuaiyiNie
Follow
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
6 days ago
S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models
authored
a paper
7 days ago
S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models
liked
a dataset
2 months ago
lmsys/mt_bench_human_judgments
View all activity
Organizations
None yet
Papers
1
arxiv:
2504.10368
models
None public yet
datasets
None public yet