This commit is contained in:
Paul Gauthier 2024-06-20 08:29:16 -07:00
parent 090e0cdcfe
commit e5e07f9507
2 changed files with 4 additions and 3 deletions

View file

@ -630,7 +630,7 @@
indentation_errors: 0 indentation_errors: 0
exhausted_context_windows: 0 exhausted_context_windows: 0
test_timeouts: 0 test_timeouts: 0
command: aider --model openrouter/anthropic/claude-3.5-sonnet command: aider --model openrouter/anthropic/claude-3.5-sonnet --edit-format whole
date: 2024-06-20 date: 2024-06-20
versions: 0.38.1-dev versions: 0.38.1-dev
seconds_per_case: 15.4 seconds_per_case: 15.4
@ -638,7 +638,7 @@
- dirname: 2024-06-20-15-16-41--claude-3.5-sonnet-diff - dirname: 2024-06-20-15-16-41--claude-3.5-sonnet-diff
test_cases: 133 test_cases: 133
model: openrouter/anthropic/claude-3.5-sonnet model: claude-3.5-sonnet (diff)
edit_format: diff edit_format: diff
commit_hash: 068609e-dirty commit_hash: 068609e-dirty
pass_rate_1: 57.9 pass_rate_1: 57.9

View file

@ -19,13 +19,14 @@ it works best with models that score well on the benchmarks.
## Claude 3.5 Sonnet takes the top spot ## Claude 3.5 Sonnet takes the top spot
Claude 3.5 Sonnet is now the top ranked model on aider's code editing leaderboard. Claude 3.5 Sonnet is now the top ranked model on aider's code editing leaderboard.
DeepSeek Coder V2 previously took the #1 spot, only 4 days ago. DeepSeek Coder V2 took the #1 spot only 4 days ago.
Sonnet ranked #1 when using the "whole" editing format, Sonnet ranked #1 when using the "whole" editing format,
but it also scored very well with but it also scored very well with
aider's "diff" editing format. aider's "diff" editing format.
This format allows it to return code changes as diffs -- saving time and token costs, This format allows it to return code changes as diffs -- saving time and token costs,
and making it practical to work with larger source files. and making it practical to work with larger source files.
As such, aider uses "diff" by default with this new Sonnet model.
## Code editing leaderboard ## Code editing leaderboard