Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
eipi1-0
's Collections
Awesome tiny LLMs
LM Preference datas
Remarkable LLMs
Code datas
LM datas
LM SFT datas
LLM RL papers
LLM Leaderboards
LLM Benchmark datas
LLM Benchmark datas
updated
19 days ago
Upvote
-
truthfulqa/truthful_qa
Viewer
•
Updated
Jan 4
•
1.63k
•
28.5k
•
199
tasksource/bigbench
Updated
May 11, 2023
•
8.32k
•
61
THUDM/LongBench
Viewer
•
Updated
Aug 29, 2023
•
8.42k
•
37.3k
•
119
tau/scrolls
Updated
Jan 12
•
2.2k
•
25
gsarti/flores_101
Viewer
•
Updated
Oct 27, 2022
•
207k
•
2.39k
•
20
nguha/legalbench
Updated
Sep 30
•
14.5k
•
82
google/bigbench
Updated
Jan 18
•
871
•
54
GEM/gem
Updated
Jan 18
•
9.16k
•
30
m-a-p/CMMMU
Viewer
•
Updated
Sep 5
•
12k
•
271
•
29
openai/gsm8k
Viewer
•
Updated
Jan 4
•
17.6k
•
197k
•
412
codeparrot/apps
Viewer
•
Updated
Oct 20, 2022
•
20k
•
2.54k
•
123
froggeric/creativity
Viewer
•
Updated
May 28
•
48
•
131
•
81
openai/openai_humaneval
Viewer
•
Updated
Jan 4
•
164
•
144k
•
247
Idavidrein/gpqa
Viewer
•
Updated
Mar 28
•
1.25k
•
15.3k
•
70
google/IFEval
Viewer
•
Updated
Aug 14
•
541
•
5.91k
•
34
TIGER-Lab/MMLU-Pro
Viewer
•
Updated
26 days ago
•
12.1k
•
28.1k
•
282
TAUR-Lab/MuSR
Viewer
•
Updated
May 21
•
756
•
8.77k
•
9
lighteval/MATH-Hard
Viewer
•
Updated
Jun 12
•
7.26k
•
9.52k
•
17
wis-k/instruction-following-eval
Viewer
•
Updated
Dec 5, 2023
•
541
•
12.4k
•
4
SaylorTwift/bbh
Viewer
•
Updated
Jun 16
•
6.76k
•
9.22k
•
3
m-a-p/CodeEditorBench
Preview
•
Updated
Apr 5
•
179
•
19
m-a-p/CHC-Bench
Viewer
•
Updated
Apr 8
•
214
•
244
•
8
openai/MMMLU
Viewer
•
Updated
27 days ago
•
393k
•
1.84k
•
414
lighteval/MATH
Viewer
•
Updated
Oct 17, 2023
•
25k
•
11.4k
•
53
cais/mmlu
Viewer
•
Updated
Mar 8
•
231k
•
66.8k
•
321
hails/agieval-logiqa-zh
Viewer
•
Updated
Jan 26
•
651
•
659
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections