mirror of
https://github.com/Aider-AI/aider.git
synced 2025-05-31 01:35:00 +00:00
copy
This commit is contained in:
parent
2cb9a8ddc8
commit
941456d586
1 changed files with 5 additions and 6 deletions
|
@ -17,8 +17,7 @@ The best result reported elsewhere seems to be
|
||||||
[13.9% from Devin](https://www.cognition.ai/post/swe-bench-technical-report).
|
[13.9% from Devin](https://www.cognition.ai/post/swe-bench-technical-report).
|
||||||
|
|
||||||
This result on the main SWE Bench is in addition to
|
This result on the main SWE Bench is in addition to
|
||||||
[aider's SOTA result on the easier SWE Bench Lite](https://aider.chat/2024/05/22/swe-bench-lite.html)
|
[aider's recent SOTA result on the easier SWE Bench Lite](https://aider.chat/2024/05/22/swe-bench-lite.html).
|
||||||
that was reported recently.
|
|
||||||
|
|
||||||
[](https://aider.chat/assets/swe_bench.svg)
|
[](https://aider.chat/assets/swe_bench.svg)
|
||||||
|
|
||||||
|
@ -106,9 +105,9 @@ and to use pytest to run tests.
|
||||||
- `aider --yes --test-cmd pytest`
|
- `aider --yes --test-cmd pytest`
|
||||||
- They could start the chat by pasting in the URL or text of a GitHub issue.
|
- They could start the chat by pasting in the URL or text of a GitHub issue.
|
||||||
Aider will pull in the URL's content and then try and resolve the issue.
|
Aider will pull in the URL's content and then try and resolve the issue.
|
||||||
- If aider doesn't produce code that lints and tests clean, the user might decide to revert the changes and try again, maybe using aider with a different LLM this time.
|
- If aider doesn't produce code that lints and tests clean, the user might decide to
|
||||||
[Aider is tightly integrated with git](https://aider.chat/docs/faq.html#how-does-aider-use-git),
|
[use git to revert the changes](https://aider.chat/docs/faq.html#how-does-aider-use-git),
|
||||||
so it's always easy to revert AI changes that don't pan out.
|
and try again with `aider --opus`. Many aider users employ this strategy.
|
||||||
|
|
||||||
## Aider with GPT-4o alone was SOTA
|
## Aider with GPT-4o alone was SOTA
|
||||||
|
|
||||||
|
@ -146,7 +145,7 @@ verified as correctly resolving their issue.
|
||||||
|
|
||||||
## Non-plausible but correct solutions?
|
## Non-plausible but correct solutions?
|
||||||
|
|
||||||
A solution doesn't have to be plausible in order to correctly resolve the issue.
|
A solution doesn't actually have to be plausible in order to correctly resolve the issue.
|
||||||
Recall that plausible is simply defined as aider
|
Recall that plausible is simply defined as aider
|
||||||
reporting that it successfully completed all file edits,
|
reporting that it successfully completed all file edits,
|
||||||
repaired and resolved any linting errors
|
repaired and resolved any linting errors
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue