Can Large Language Models Capture Human Annotator Disagreements? Paper โข 2506.19467 โข Published 2 days ago โข 15
Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning Paper โข 2502.11962 โข Published Feb 17 โข 34
Open LLM Leaderboard best models โค๏ธโ๐ฅ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: โข 65 items โข Updated Mar 20 โข 608