mirror of
https://github.com/Aider-AI/aider.git
synced 2025-05-31 01:35:00 +00:00
copy
This commit is contained in:
parent
0d06364db6
commit
b184ab9977
2 changed files with 2 additions and 2 deletions
|
@ -418,7 +418,7 @@ This is contrast to a pass@N result for N>1, where N attempts are made
|
|||
and all N solutions are evaluated by the acceptance tests.
|
||||
If *any* of the N solution pass, that counts as a pass@N success.
|
||||
|
||||
Below are the references for the pass@1 unhinted SWE-Bench results
|
||||
Below are the references for the other pass@1 unhinted SWE-Bench results
|
||||
displayed in the graph at the beginning of this article.
|
||||
|
||||
- [20.3% Amazon Q Developer Agent (v20240430-dev)](https://www.swebench.com)
|
||||
|
|
|
@ -242,7 +242,7 @@ This is contrast to a pass@N result for N>1, where N attempts are made
|
|||
and all N solutions are evaluated by the acceptance tests.
|
||||
If *any* of the N solution pass, that counts as a pass@N success.
|
||||
|
||||
Below are the references for the pass@1 unhinted SWE-Bench results
|
||||
Below are the references for the other pass@1 unhinted SWE-Bench results
|
||||
displayed in the graph at the beginning of this article.
|
||||
|
||||
- [13.9% Devin, benchmarked on 570 instances.](https://www.cognition.ai/post/swe-bench-technical-report)
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue