--- highlight_image: /assets/leaderboard.jpg nav_order: 950 description: Quantitative benchmarks of LLM code editing skill. has_children: true --- # Aider LLM Leaderboards Aider excels with LLMs skilled at writing and *editing* code, and uses benchmarks to evaluate an LLM's ability to follow instructions and edit code successfully without human intervention. [Aider's polyglot benchmark](https://aider.chat/2024/12/21/polyglot.html#the-polyglot-benchmark) tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust. ## Aider polyglot coding leaderboard
Model | Percent correct | Cost (log scale) | Command | Correct edit format | Edit Format | |
---|---|---|---|---|---|---|
{{ row.model }} | {{ row.pass_rate_2 }}% | {% if row.total_cost > 0 %} {% endif %} {% assign rounded_cost = row.total_cost | times: 1.0 | round: 2 %} {% if row.total_cost == 0 or rounded_cost == 0.00 %}?{% else %}${{ rounded_cost }}{% endif %} | {{ row.command }} |
{{ row.percent_cases_well_formed }}% | {{ row.edit_format }} |
By Paul Gauthier, last updated April 12, 2025.