CodeEval is an innovative, pedagogy-based benchmarking dataset that targeted evaluation of code-trained LLMs. It assesses LLMs across 27 distinct aspects of Python programming at three proficiency ...
You can create a release to package software, along with release notes and links to binary files, for other people to use. Learn more about releases in our docs.