view article Article Selene 1 Mini: the best small language model-as-a-judge By AtlaAI and 10 others • Jan 29 • 12
view article Article Judge Arena: Benchmarking LLMs as Evaluators By kaikaidai and 7 others • Nov 19, 2024 • 57