mirror of
https://github.com/Aider-AI/aider.git
synced 2025-06-04 03:35:00 +00:00
Merge remote-tracking branch 'origin/main'
This commit is contained in:
commit
ca0faf7cc7
2 changed files with 2 additions and 2 deletions
|
@ -42,7 +42,7 @@
|
||||||
total_cost: 0.0000
|
total_cost: 0.0000
|
||||||
- dirname: 2024-04-29-19-17-28--deepseek-coder-whole
|
- dirname: 2024-04-29-19-17-28--deepseek-coder-whole
|
||||||
test_cases: 132
|
test_cases: 132
|
||||||
model: openai/deepseek-coder
|
model: deepseek-coder
|
||||||
edit_format: whole
|
edit_format: whole
|
||||||
commit_hash: c07f793-dirty
|
commit_hash: c07f793-dirty
|
||||||
pass_rate_1: 47.0
|
pass_rate_1: 47.0
|
||||||
|
|
|
@ -8,7 +8,7 @@ highlight_image: /assets/leaderboard.jpg
|
||||||
Aider works best with LLMs which are good at *editing* code, not just good at writing
|
Aider works best with LLMs which are good at *editing* code, not just good at writing
|
||||||
code.
|
code.
|
||||||
To evaluate an LLM's editing skill, aider uses a pair of benchmarks that
|
To evaluate an LLM's editing skill, aider uses a pair of benchmarks that
|
||||||
assess their ability to consistently follow the system instructions
|
assess a model's ability to consistently follow the system prompt
|
||||||
to successfully edit code.
|
to successfully edit code.
|
||||||
|
|
||||||
The leaderboards below report the results from a number of popular LLMs.
|
The leaderboards below report the results from a number of popular LLMs.
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue