Thanks to visit codestin.com
Credit goes to autocodebench.github.io

AutoCodeBench: Large Language Models are Automatic Code Benchmark Generators

[email protected]; {nickaliu,wigginzhou,faxonlian}@tencent.com
Hunyuan Team, Tencent
*Equal Contributions Corresponding Authors
AutoCodeBench-v2 Leaderboard
HumanEval Overfitting
HumanEval Overfitting