Finding Blind Spots in Evaluator LLMs with Interpretable Checklists Paper • 2406.13439 • Published Jun 19