SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents Paper • 2504.08703 • Published Apr 11, 2025 • 1