From 8cffb975d9e5c762e8850fde3f95fe081cfa6f7d Mon Sep 17 00:00:00 2001 From: Paul Gauthier Date: Sun, 13 Apr 2025 09:47:32 -0700 Subject: [PATCH] docs: Clarify Aider's LLM evaluation criteria and add polyglot link --- aider/website/docs/leaderboards/index.md | 10 ++++------ 1 file changed, 4 insertions(+), 6 deletions(-) diff --git a/aider/website/docs/leaderboards/index.md b/aider/website/docs/leaderboards/index.md index 0b08705ee..6065d360c 100644 --- a/aider/website/docs/leaderboards/index.md +++ b/aider/website/docs/leaderboards/index.md @@ -8,16 +8,14 @@ has_children: true # Aider LLM Leaderboards -Aider excels with LLMs skilled at *editing* code, not just writing it. -These benchmarks evaluate an LLM's ability to follow instructions and edit code successfully without +Aider excels with LLMs skilled at writing and *editing* code, +and uses benchmarks to +evaluate an LLM's ability to follow instructions and edit code successfully without human intervention. -Aider works best with high-scoring models, though it [can connect to almost any LLM](/docs/llms.html). - +[Aider's polyglot benchmark](https://aider.chat/2024/12/21/polyglot.html#the-polyglot-benchmark) tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust. ## Aider polyglot coding leaderboard -[Aider's polyglot benchmark](https://aider.chat/2024/12/21/polyglot.html#the-polyglot-benchmark) tests LLMs on 225 challenging Exercism coding exercises across C++, Go, Java, JavaScript, Python, and Rust. -