JustLogic: A Comprehensive Benchmark for Evaluating Deductive Reasoning in Large Language Models Paper • 2501.14851 • Published Jan 24