This commit is contained in:
Paul Gauthier 2024-05-31 08:16:56 -07:00
parent aa1e34982d
commit c76215f302

View file

@ -67,6 +67,8 @@ The held out "acceptance tests" were *only* used
after benchmarking to compute statistics on which problems aider
correctly resolved.
The [full harness to run aider on SWE Bench Lite is available on GitHub](https://github.com/paul-gauthier/aider-swe-bench).
The benchmarking process was similar to how a developer might use aider to
resolve a GitHub issue: