Running on CPU Upgrade 91 91 Open LLM Leaderboard Model Comparator 🏆 Compare Open LLM Leaderboard results
Running on CPU Upgrade 12.9k 12.9k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Running 60 60 R1-distilled leaderboard ⚡ Generate a leaderboard of open-r1 models based on evaluation scores