mirror of
https://github.com/Aider-AI/aider.git
synced 2025-06-03 03:05:00 +00:00
copy
This commit is contained in:
parent
8a8f3936f4
commit
0d06364db6
2 changed files with 4 additions and 4 deletions
|
@ -15,7 +15,7 @@ from Amazon Q Developer Agent.
|
|||
|
||||
[](https://aider.chat/assets/swe_bench_lite.svg)
|
||||
|
||||
**To be clear, all of aider's results reported here are pass@1 results,
|
||||
**All of aider's results reported here are pass@1 results,
|
||||
obtained without using the SWE Bench `hints_text`.**
|
||||
All results in the above chart are unhinted pass@1 results.
|
||||
Please see the [references](#references)
|
||||
|
@ -407,7 +407,7 @@ making it faster, easier, and more reliable to run the acceptance tests.
|
|||
|
||||
## References
|
||||
|
||||
To be clear, all of aider's results reported here are pass@1 results,
|
||||
All of aider's results reported here are pass@1 results,
|
||||
obtained without using the SWE Bench `hints_text`.
|
||||
|
||||
The "aider agent" internally makes multiple "attempts" at solving the problem,
|
||||
|
|
|
@ -20,7 +20,7 @@ This result on the main SWE Bench builds on
|
|||
|
||||
[](https://aider.chat/assets/swe_bench.svg)
|
||||
|
||||
**To be clear, all of aider's results reported here are pass@1 results,
|
||||
**All of aider's results reported here are pass@1 results,
|
||||
obtained without using the SWE Bench `hints_text`.**
|
||||
Aider was benchmarked on the same
|
||||
[570 randomly selected SWE Bench problems](https://github.com/CognitionAI/devin-swebench-results/tree/main/output_diffs)
|
||||
|
@ -231,7 +231,7 @@ making it faster, easier, and more reliable to run the acceptance tests.
|
|||
|
||||
## References
|
||||
|
||||
To be clear, all of aider's results reported here are pass@1 results,
|
||||
All of aider's results reported here are pass@1 results,
|
||||
obtained without using the SWE Bench `hints_text`.
|
||||
|
||||
The "aider agent" internally makes multiple "attempts" at solving the problem,
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue