mirror of
https://github.com/Aider-AI/aider.git
synced 2025-06-05 04:05:04 +00:00
copy
This commit is contained in:
parent
8a8f3936f4
commit
0d06364db6
2 changed files with 4 additions and 4 deletions
|
@ -15,7 +15,7 @@ from Amazon Q Developer Agent.
|
||||||
|
|
||||||
[](https://aider.chat/assets/swe_bench_lite.svg)
|
[](https://aider.chat/assets/swe_bench_lite.svg)
|
||||||
|
|
||||||
**To be clear, all of aider's results reported here are pass@1 results,
|
**All of aider's results reported here are pass@1 results,
|
||||||
obtained without using the SWE Bench `hints_text`.**
|
obtained without using the SWE Bench `hints_text`.**
|
||||||
All results in the above chart are unhinted pass@1 results.
|
All results in the above chart are unhinted pass@1 results.
|
||||||
Please see the [references](#references)
|
Please see the [references](#references)
|
||||||
|
@ -407,7 +407,7 @@ making it faster, easier, and more reliable to run the acceptance tests.
|
||||||
|
|
||||||
## References
|
## References
|
||||||
|
|
||||||
To be clear, all of aider's results reported here are pass@1 results,
|
All of aider's results reported here are pass@1 results,
|
||||||
obtained without using the SWE Bench `hints_text`.
|
obtained without using the SWE Bench `hints_text`.
|
||||||
|
|
||||||
The "aider agent" internally makes multiple "attempts" at solving the problem,
|
The "aider agent" internally makes multiple "attempts" at solving the problem,
|
||||||
|
|
|
@ -20,7 +20,7 @@ This result on the main SWE Bench builds on
|
||||||
|
|
||||||
[](https://aider.chat/assets/swe_bench.svg)
|
[](https://aider.chat/assets/swe_bench.svg)
|
||||||
|
|
||||||
**To be clear, all of aider's results reported here are pass@1 results,
|
**All of aider's results reported here are pass@1 results,
|
||||||
obtained without using the SWE Bench `hints_text`.**
|
obtained without using the SWE Bench `hints_text`.**
|
||||||
Aider was benchmarked on the same
|
Aider was benchmarked on the same
|
||||||
[570 randomly selected SWE Bench problems](https://github.com/CognitionAI/devin-swebench-results/tree/main/output_diffs)
|
[570 randomly selected SWE Bench problems](https://github.com/CognitionAI/devin-swebench-results/tree/main/output_diffs)
|
||||||
|
@ -231,7 +231,7 @@ making it faster, easier, and more reliable to run the acceptance tests.
|
||||||
|
|
||||||
## References
|
## References
|
||||||
|
|
||||||
To be clear, all of aider's results reported here are pass@1 results,
|
All of aider's results reported here are pass@1 results,
|
||||||
obtained without using the SWE Bench `hints_text`.
|
obtained without using the SWE Bench `hints_text`.
|
||||||
|
|
||||||
The "aider agent" internally makes multiple "attempts" at solving the problem,
|
The "aider agent" internally makes multiple "attempts" at solving the problem,
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue