DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation Paper • 2503.01622 • Published Mar 3
Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias Paper • 2308.00225 • Published Aug 1, 2023