Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Evaluation datasets
community
Activity Feed
Follow
58
AI & ML interests
None defined yet.
Recent Activity
lewtun
updated
a model
8 days ago
lighteval/different-chat-templates-per-revision
lewtun
authored
a paper
11 days ago
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning
thomwolf
authored
a paper
about 2 months ago
SmolVLM: Redefining small and efficient multimodal models
View all activity
Team members
8
models
1
lighteval/different-chat-templates-per-revision
Updated
8 days ago
datasets
75
Sort: Recently updated
lighteval/okapi_mmlu
Viewer
•
Updated
Mar 24
•
443k
•
154
•
1
lighteval/okapi_arc_challenge
Viewer
•
Updated
Mar 24
•
79.6k
•
305
•
1
lighteval/small_natural_questions
Viewer
•
Updated
Jan 29
•
1.71k
•
76
lighteval/SimpleQA
Viewer
•
Updated
Jan 28
•
4.33k
•
143
•
2
lighteval/MWP-TR
Viewer
•
Updated
Jan 10
•
4.16k
•
18
lighteval/MathQA-TR
Viewer
•
Updated
Jan 10
•
19.6k
•
29
lighteval/QazUNTv2
Viewer
•
Updated
Nov 26, 2024
•
1.7k
•
27
lighteval/HAWP
Viewer
•
Updated
Nov 19, 2024
•
2.34k
•
18
•
1
lighteval/elkarhizketak
Viewer
•
Updated
Oct 8, 2024
•
1.63k
•
24
lighteval/hellaswag_thai
Viewer
•
Updated
Sep 25, 2024
•
25.6k
•
33
Expand 75 datasets