mirror of
https://github.com/Aider-AI/aider.git
synced 2025-06-14 08:34:59 +00:00
copy
This commit is contained in:
parent
2ea5a98ee0
commit
ec44850646
4 changed files with 177 additions and 37 deletions
|
@ -6,7 +6,7 @@ description: Quantitative benchmark of LLM code refactoring skill.
|
|||
---
|
||||
|
||||
|
||||
## Aider refactoring leaderboard
|
||||
## Refactoring leaderboard
|
||||
|
||||
[Aider's refactoring benchmark](https://github.com/Aider-AI/refactor-benchmark) asks the LLM to refactor 89 large methods from large python classes. This is a more challenging benchmark, which tests the model's ability to output long chunks of code without skipping sections or making mistakes. It was developed to provoke and measure [GPT-4 Turbo's "lazy coding" habit](/2023/12/21/unified-diffs.html).
|
||||
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue