Merge remote-tracking branch 'origin/main'

This commit is contained in:
Paul Gauthier 2024-05-07 06:27:20 -07:00
commit ca0faf7cc7
2 changed files with 2 additions and 2 deletions

View file

@ -42,7 +42,7 @@
total_cost: 0.0000
- dirname: 2024-04-29-19-17-28--deepseek-coder-whole
test_cases: 132
model: openai/deepseek-coder
model: deepseek-coder
edit_format: whole
commit_hash: c07f793-dirty
pass_rate_1: 47.0

View file

@ -8,7 +8,7 @@ highlight_image: /assets/leaderboard.jpg
Aider works best with LLMs which are good at *editing* code, not just good at writing
code.
To evaluate an LLM's editing skill, aider uses a pair of benchmarks that
assess their ability to consistently follow the system instructions
assess a model's ability to consistently follow the system prompt
to successfully edit code.
The leaderboards below report the results from a number of popular LLMs.