mirror of
https://github.com/Aider-AI/aider.git
synced 2025-06-01 02:05:00 +00:00
22 KiB
22 KiB
highlight_image | nav_order | description | has_children |
---|---|---|---|
/assets/leaderboard.jpg | 950 | Quantitative benchmarks of LLM code editing skill. | true |
Aider LLM Leaderboards
Aider excels with LLMs skilled at editing code, not just writing it. These benchmarks evaluate an LLM's ability to follow instructions and edit code successfully without human intervention. Aider works best with high-scoring models, though it can connect to almost any LLM.
Polyglot leaderboard
Aider's polyglot benchmark tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust.
Model | Percent correct | Cost (log scale) | Command | |
---|---|---|---|---|
{{ row.model }} | {{ row.pass_rate_2 }}% | {% if row.total_cost > 0 %} {% endif %} {% assign rounded_cost = row.total_cost | times: 1.0 | round: 2 %} {% if row.total_cost == 0 or rounded_cost == 0.00 %}?{% else %}${{ rounded_cost }}{% endif %} | {{ row.command }} |
|
|
By Paul Gauthier, last updated