Spaces:
Running
Running
| title: README | |
| emoji: π | |
| colorFrom: blue | |
| colorTo: red | |
| sdk: static | |
| pinned: false | |
| We present Auto-Arena of LLMs, which automates the entire evaluation process with LLM-based agents to provide automatic, reliable, and human-like LLM evaluations. | |
| Feel free to check out: | |
| * [project page](https://auto-arena.github.io/) | |
| * [paper](https://arxiv.org/abs/2405.20267) | |
| * [Auto-Arena Leaderboard](https://huggingface.co/spaces/Auto-Arena/Leaderboard) |