GAIA release - a gaia-benchmark Collection

gaia-benchmark 's Collections

GAIA release

updated Nov 23, 2023

Gather the items of the GAIA release

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 222

Note The arxiv paper (arxiv.org/abs/2311.12983) describing the benchmark and dataset creation methodology.
Running on CPU Upgrade

469

469

GAIA Leaderboard

🦾

Submit and evaluate models on GAIA benchmark

Note The leaderboard itself with the scored models and information on how to submit a new model.
gaia-benchmark/GAIA

Updated Feb 13 • 9.47k • 380

Note The dataset with questions for the GAIA benchmark.
gaia-benchmark/results_public

Viewer • Updated about 7 hours ago • 599 • 3.48k • 16

Note Open dataset of submission results.