Post
2562
Is 100% Pass Rate on HumanEval possible? Yes! โ
Meet MGDebugger if you are tired of LLMs failing on complex bugs ๐ค Our MGDebugger, just hit 100% accuracy on HumanEval using the DeepSeek-R1 model. ๐
โจ Demo: learnmlf/MGDebugger
๐ Paper: From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging (2410.01215)
๐ป Code: https://github.com/YerbaPage/MGDebugger
HumanEval may be retired, we're ready for the next challenge In more complex scenarios! You may also take look at this repo for a collection of awesome repo-level coding tasks!
๐ฅ๏ธ https://github.com/YerbaPage/Awesome-Repo-Level-Code-Generation
Meet MGDebugger if you are tired of LLMs failing on complex bugs ๐ค Our MGDebugger, just hit 100% accuracy on HumanEval using the DeepSeek-R1 model. ๐
โจ Demo: learnmlf/MGDebugger
๐ Paper: From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging (2410.01215)
๐ป Code: https://github.com/YerbaPage/MGDebugger
HumanEval may be retired, we're ready for the next challenge In more complex scenarios! You may also take look at this repo for a collection of awesome repo-level coding tasks!
๐ฅ๏ธ https://github.com/YerbaPage/Awesome-Repo-Level-Code-Generation