--- highlight_image: /assets/leaderboard.jpg nav_order: 950 description: Quantitative benchmarks of LLM code editing skill. has_children: true --- # Aider LLM Leaderboards Aider excels with LLMs skilled at writing and *editing* code, and uses benchmarks to evaluate an LLM's ability to follow instructions and edit code successfully without human intervention. [Aider's polyglot benchmark](https://aider.chat/2024/12/21/polyglot.html#the-polyglot-benchmark) tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust. ## Aider polyglot coding leaderboard
{% assign max_cost = 0 %} {% for row in site.data.polyglot_leaderboard %} {% if row.total_cost > max_cost %} {% assign max_cost = row.total_cost %} {% endif %} {% endfor %} {% if max_cost == 0 %}{% assign max_cost = 1 %}{% endif %} {% assign edit_sorted = site.data.polyglot_leaderboard | sort: 'pass_rate_2' | reverse %} {% for row in edit_sorted %} {% comment %} Add loop index for unique IDs {% endcomment %} {% assign row_index = forloop.index0 %} {% endfor %}
Model Percent correct Cost (log scale) Command Correct edit format Edit Format
{{ row.model }}
{{ row.pass_rate_2 }}%
{% if row.total_cost > 0 %}
{% endif %} {% assign rounded_cost = row.total_cost | times: 1.0 | round: 2 %} {% if row.total_cost == 0 or rounded_cost == 0.00 %}?{% else %}${{ rounded_cost }}{% endif %}
{{ row.command }} {{ row.percent_cases_well_formed }}% {{ row.edit_format }}

By Paul Gauthier, last updated April 12, 2025.