DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5.
Simin Chen
CM
AI & ML interests
None yet
Recent Activity
View all activity