BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18 • 39
Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning Paper • 2410.03103 • Published Oct 4 • 6
RegMix: Data Mixture as Regression for Language Model Pre-training Paper • 2407.01492 • Published Jul 1 • 34
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published Jun 22 • 45
đŸŒ¸BigCodeBench Collection Benchmarking Code Generation with Diverse Function Calls and Complex Instructions https://bigcode-bench.github.io/ • 7 items • Updated Jul 18 • 2
view article Article BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18 • 39
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order Paper • 2404.00399 • Published Mar 30 • 41
Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models Paper • 2401.00788 • Published Jan 1 • 21
Pop Quiz! Do Pre-trained Code Models Possess Knowledge of Correct API Names? Paper • 2309.07804 • Published Sep 14, 2023 • 2
Large Language Models Are State-of-the-Art Evaluators of Code Generation Paper • 2304.14317 • Published Apr 27, 2023 • 2
Source Code Data Augmentation for Deep Learning: A Survey Paper • 2305.19915 • Published May 31, 2023 • 1
OctoPack: Instruction Tuning Code Large Language Models Paper • 2308.07124 • Published Aug 14, 2023 • 28