mirror of
https://github.com/Aider-AI/aider.git
synced 2025-05-29 08:44:59 +00:00
copy
This commit is contained in:
parent
090e0cdcfe
commit
e5e07f9507
2 changed files with 4 additions and 3 deletions
|
@ -630,7 +630,7 @@
|
||||||
indentation_errors: 0
|
indentation_errors: 0
|
||||||
exhausted_context_windows: 0
|
exhausted_context_windows: 0
|
||||||
test_timeouts: 0
|
test_timeouts: 0
|
||||||
command: aider --model openrouter/anthropic/claude-3.5-sonnet
|
command: aider --model openrouter/anthropic/claude-3.5-sonnet --edit-format whole
|
||||||
date: 2024-06-20
|
date: 2024-06-20
|
||||||
versions: 0.38.1-dev
|
versions: 0.38.1-dev
|
||||||
seconds_per_case: 15.4
|
seconds_per_case: 15.4
|
||||||
|
@ -638,7 +638,7 @@
|
||||||
|
|
||||||
- dirname: 2024-06-20-15-16-41--claude-3.5-sonnet-diff
|
- dirname: 2024-06-20-15-16-41--claude-3.5-sonnet-diff
|
||||||
test_cases: 133
|
test_cases: 133
|
||||||
model: openrouter/anthropic/claude-3.5-sonnet
|
model: claude-3.5-sonnet (diff)
|
||||||
edit_format: diff
|
edit_format: diff
|
||||||
commit_hash: 068609e-dirty
|
commit_hash: 068609e-dirty
|
||||||
pass_rate_1: 57.9
|
pass_rate_1: 57.9
|
||||||
|
|
|
@ -19,13 +19,14 @@ it works best with models that score well on the benchmarks.
|
||||||
## Claude 3.5 Sonnet takes the top spot
|
## Claude 3.5 Sonnet takes the top spot
|
||||||
|
|
||||||
Claude 3.5 Sonnet is now the top ranked model on aider's code editing leaderboard.
|
Claude 3.5 Sonnet is now the top ranked model on aider's code editing leaderboard.
|
||||||
DeepSeek Coder V2 previously took the #1 spot, only 4 days ago.
|
DeepSeek Coder V2 took the #1 spot only 4 days ago.
|
||||||
|
|
||||||
Sonnet ranked #1 when using the "whole" editing format,
|
Sonnet ranked #1 when using the "whole" editing format,
|
||||||
but it also scored very well with
|
but it also scored very well with
|
||||||
aider's "diff" editing format.
|
aider's "diff" editing format.
|
||||||
This format allows it to return code changes as diffs -- saving time and token costs,
|
This format allows it to return code changes as diffs -- saving time and token costs,
|
||||||
and making it practical to work with larger source files.
|
and making it practical to work with larger source files.
|
||||||
|
As such, aider uses "diff" by default with this new Sonnet model.
|
||||||
|
|
||||||
## Code editing leaderboard
|
## Code editing leaderboard
|
||||||
|
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue