codegen_eval - a drgitt Collection

drgitt 's Collections

Interesting LLMs

codegen_eval

updated Jun 1, 2024

nuprl/MultiPL-E

Viewer • Updated Jul 15, 2025 • 12.7k • 58.2k • 67
openai/openai_humaneval

Viewer • Updated Jan 4, 2024 • 164 • 293k • 389
Running

Agents

1.51k

Big Code Models Leaderboard

📈

1.51k

Explore and compare code model performance on a leaderboard
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming

Paper • 2402.14261 • Published Feb 22, 2024 • 10
facebook/deit-base-patch16-224

Image Classification • Updated Jul 13, 2022 • 34.9k • • 15