Cost-of-Pass: An Economic Framework for Evaluating Language Models Paper • 2504.13359 • Published 5 days ago • 4
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval Paper • 2310.15511 • Published Oct 24, 2023 • 5
Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models Paper • 2309.15098 • Published Sep 26, 2023 • 7