Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Simin Chen's picture
3 3 6

Simin Chen

CM
AYouni's profile picture GigaBoy's profile picture 21world's profile picture
·
  • SeekingDream

AI & ML interests

None yet

Recent Activity

updated a collection 3 days ago
DyCodeEval
upvoted a paper 3 days ago
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
updated a collection 3 days ago
DyCodeEval
View all activity

Organizations

Language Technology Research Group at the University of Helsinki's profile picture Code Kaleidoscope's profile picture

CM 's collections 1

DyCodeEval
DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5.
  • CM/Dynamic_HumanEvalZero

    Viewer • Updated 6 days ago • 15.7k • 9
  • CM/Dynamic_MBPP_sanitized

    Viewer • Updated 6 days ago • 15.8k • 5
  • Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

    Paper • 2503.04149 • Published Mar 6 • 4
DyCodeEval
DyCodeEval (ICML 2025) enables dynamic benchmarking for code LLMs. This collection features dynamic HumanEval and MBPP sets generated with Claude 3.5.
  • CM/Dynamic_HumanEvalZero

    Viewer • Updated 6 days ago • 15.7k • 9
  • CM/Dynamic_MBPP_sanitized

    Viewer • Updated 6 days ago • 15.8k • 5
  • Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

    Paper • 2503.04149 • Published Mar 6 • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs