265
GAIA Leaderboard
π¦Ύ
Submit and evaluate models on a leaderboard
Submit and evaluate models on a leaderboard
Compare model answers to questions
Track, rank and evaluate open LLMs and chatbots
Explore and filter language model benchmark results
Run a Streamlit web app
Evaluate language models automatically
Display and explore model leaderboards and chat history