benchmark haonan-li/cmmlu Viewer • Updated Jul 13, 2023 • 11.9k • 7.43k • 72 nlp-waseda/JMMLU Updated Feb 27, 2024 • 364 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 19.5k • 81 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 104k • 333
benchmark haonan-li/cmmlu Viewer • Updated Jul 13, 2023 • 11.9k • 7.43k • 72 nlp-waseda/JMMLU Updated Feb 27, 2024 • 364 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 19.5k • 81 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 104k • 333