RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper • 2504.17502 • Published 2 days ago • 49
JuStRank: Benchmarking LLM Judges for System Ranking Paper • 2412.09569 • Published Dec 12, 2024 • 20
ModuleFormer: Learning Modular Large Language Models From Uncurated Data Paper • 2306.04640 • Published Jun 7, 2023 • 7