aider/benchmark/swe-bench.txt
Paul Gauthier 2cb9a8ddc8 copy
2024-06-01 16:10:55 -07:00

7 lines
209 B
Text

18.9% Aider|GPT-4o|& Opus|(570)
17.0% Aider|GPT-4o|(570)
13.9% Devin|(570)
13.8% Amazon Q|Developer|Agent|(2,294)
12.5% SWE-|Agent|+ GPT-4|(2,294)
10.6% Auto|Code|Rover|(2,294)
10.5% SWE-|Agent|+ Opus|(2,294)