EvalEval Bot
EvalEvalBot
AI & ML interests
None yet
Recent Activity
new activity about 2 hours ago
evaleval/EEE_datastore:Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks) new activity about 3 hours ago
evaleval/EEE_datastore:Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models) new activity about 3 hours ago
evaleval/EEE_datastore:Add HELM AIR-Bench v1.16.0 results