JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Paper
•
2310.17631
•
Published
•
35
Curated resources that support the use of LLMs to serve as automatic evaluators of other LLM outputs.
Vote on AI responses to rank models