Code Generation

CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs

Dracarys2

Copilot Arena

M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation

rabbit

OpenHands: Code Less, Make More

CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs

コーディングAI課金するならCodyが断トツ良い話

Can LLMs write better code if you keep asking them to “write better code”?

gitingest は GitHub の URL の'hub'を'ingest'に置き換えることで、コードベースのテキスト抽出を可能にするツール

コーディングAI課金するならCodyが断トツ良い話