benchmark haonan-li/cmmlu Viewer • Updated Jul 13, 2023 • 11.9k • 9.98k • 72 nlp-waseda/JMMLU Updated Feb 27, 2024 • 499 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 21k • 77 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 85.7k • 327
benchmark haonan-li/cmmlu Viewer • Updated Jul 13, 2023 • 11.9k • 9.98k • 72 nlp-waseda/JMMLU Updated Feb 27, 2024 • 499 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 21k • 77 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 85.7k • 327