-
216
MMLU-Pro Leaderboard
๐ฅMore advanced and challenging multi-task evaluation
-
47
Stick To Your Role! Leaderboard
๐ญBenchmarking LLMs on the stability of simulated populations
-
52
ZeroEval Leaderboard
๐Embed and use ZeroEval for evaluation tasks
-
26
Decentralized Arena Leaderboard
๐ฅDisplay model leaderboard evaluations
Hristo Panev
hppdqdq
AI & ML interests
None yet
Recent Activity
liked
a model
10 days ago
bartowski/TheDrummer_Cydonia-24B-v3.1-GGUF
liked
a model
18 days ago
mradermacher/Broken-Tutu-24B-Unslop-v2.0-i1-GGUF
liked
a model
29 days ago
kyutai/stt-2.6b-en
Organizations
None yet