Code contamination in HumanEval and MBPP (#12)
Browse files- Update contamination_report.csv (cbab33e322ea23a1d1a9ca8a694e2740c09ac2b4)
- Update contamination_report.csv (04c55d99499257e0d0e3ca2c7c97bb1e1060c973)
Co-authored-by: Ameya Prabhu <[email protected]>
- contamination_report.csv +6 -1
contamination_report.csv
CHANGED
|
@@ -462,4 +462,9 @@ bigbio/mednli;;GPT-3.5;model;0.0;0.0;0.0;model-based;https://arxiv.org/pdf/2308.
|
|
| 462 |
|
| 463 |
RadNLI;;GPT-4;model;0.0;0.0;0.0;model-based;https://arxiv.org/pdf/2308.08493;8
|
| 464 |
RadNLI;;GPT-3.5;model;0.0;0.0;0.0;model-based;https://arxiv.org/pdf/2308.08493;8
|
| 465 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 462 |
|
| 463 |
RadNLI;;GPT-4;model;0.0;0.0;0.0;model-based;https://arxiv.org/pdf/2308.08493;8
|
| 464 |
RadNLI;;GPT-3.5;model;0.0;0.0;0.0;model-based;https://arxiv.org/pdf/2308.08493;8
|
| 465 |
+
|
| 466 |
+
|
| 467 |
+
openai_humaneval;;EleutherAI/pile;corpus;;;12.2;data-based;https://arxiv.org/abs/2403.04811;12
|
| 468 |
+
mbpp;;EleutherAI/pile;corpus;;;3.6;data-based;https://arxiv.org/abs/2403.04811;12
|
| 469 |
+
openai_humaneval;;bigcode/the-stack;corpus;;;18.9;data-based;https://arxiv.org/abs/2403.04811;12
|
| 470 |
+
mbpp;;bigcode/the-stack;corpus;;;20.8;data-based;https://arxiv.org/abs/2403.04811;12
|