Running on CPU Upgrade 191 191 MMLU-Pro Leaderboard ๐ฅ More advanced and challenging multi-task evaluation
Running on CPU Upgrade 90 90 LLM Safety Leaderboard ๐ฅ View and submit machine learning model evaluations