mirror of
https://github.com/Aider-AI/aider.git
synced 2025-06-01 02:05:00 +00:00
12 KiB
12 KiB
highlight_image | nav_order | description | has_children |
---|---|---|---|
/assets/leaderboard.jpg | 950 | Quantitative benchmarks of LLM code editing skill. | true |
Aider LLM Leaderboards
Aider excels with LLMs skilled at writing and editing code, and uses benchmarks to evaluate an LLM's ability to follow instructions and edit code successfully without human intervention. Aider's polyglot benchmark tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust.
Aider polyglot coding leaderboard
Model | Percent correct | Cost (log scale) | Command | Correct edit format | Edit Format | |
---|---|---|---|---|---|---|
{{ row.model }} | {{ row.pass_rate_2 }}% | {% if row.total_cost > 0 %} {% endif %} {% assign rounded_cost = row.total_cost | times: 1.0 | round: 2 %} {% if row.total_cost == 0 or rounded_cost == 0.00 %}?{% else %}${{ rounded_cost }}{% endif %} | {{ row.command }} |
{{ row.percent_cases_well_formed }}% | {{ row.edit_format }} | |
|
By Paul Gauthier, last updated