copy

2025-06-03 03:05:00 +00:00 · 2024-06-03 11:14:17 -07:00 · 2024-06-03 11:14:17 -07:00 · 0d06364db6
commit 0d06364db6
parent 8a8f3936f4
2 changed files with 4 additions and 4 deletions
--- a/_posts/2024-05-22-swe-bench-lite.md
+++ b/_posts/2024-05-22-swe-bench-lite.md
@ -15,7 +15,7 @@ from Amazon Q Developer Agent.

 [![SWE Bench Lite results](/assets/swe_bench_lite.svg)](https://aider.chat/assets/swe_bench_lite.svg)

-**To be clear, all of aider's results reported here are pass@1 results,
+**All of aider's results reported here are pass@1 results,
 obtained without using the SWE Bench `hints_text`.**
 All results in the above chart are unhinted pass@1 results.
 Please see the [references](#references)
@ -407,7 +407,7 @@ making it faster, easier, and more reliable to run the acceptance tests.

 ## References

-To be clear, all of aider's results reported here are pass@1 results,
+All of aider's results reported here are pass@1 results,
 obtained without using the SWE Bench `hints_text`.

 The "aider agent" internally makes multiple "attempts" at solving the problem,
--- a/_posts/2024-06-02-main-swe-bench.md
+++ b/_posts/2024-06-02-main-swe-bench.md
@ -20,7 +20,7 @@ This result on the main SWE Bench builds on

 [![SWE Bench results](/assets/swe_bench.svg)](https://aider.chat/assets/swe_bench.svg)

-**To be clear, all of aider's results reported here are pass@1 results,
+**All of aider's results reported here are pass@1 results,
 obtained without using the SWE Bench `hints_text`.**
 Aider was benchmarked on the same
 [570 randomly selected SWE Bench problems](https://github.com/CognitionAI/devin-swebench-results/tree/main/output_diffs)
@ -231,7 +231,7 @@ making it faster, easier, and more reliable to run the acceptance tests.

 ## References

-To be clear, all of aider's results reported here are pass@1 results,
+All of aider's results reported here are pass@1 results,
 obtained without using the SWE Bench `hints_text`.

 The "aider agent" internally makes multiple "attempts" at solving the problem,