mirror of
https://github.com/Aider-AI/aider.git
synced 2025-05-29 08:44:59 +00:00
copy
This commit is contained in:
parent
fcc62ebffc
commit
26edbcc8f1
4 changed files with 55 additions and 50 deletions
|
@ -32,7 +32,8 @@ for more details on the data presented in this chart.
|
||||||
## Interactive, not agentic
|
## Interactive, not agentic
|
||||||
|
|
||||||
Aider achieved this result mainly through its existing features that focus on static
|
Aider achieved this result mainly through its existing features that focus on static
|
||||||
code analysis, reliable LLM code editing, and pragmatic UX for AI pair programming.
|
code analysis, reliable LLM code editing, and pragmatic UX for automatically
|
||||||
|
fixing linting and testing errors.
|
||||||
Aider intentionally has quite limited and narrow "agentic behavior"
|
Aider intentionally has quite limited and narrow "agentic behavior"
|
||||||
to avoid long delays, high token costs
|
to avoid long delays, high token costs
|
||||||
and the need for users to repeatedly code review incorrect solutions.
|
and the need for users to repeatedly code review incorrect solutions.
|
||||||
|
@ -203,20 +204,20 @@ The table below breaks down the benchmark outcome of each problem,
|
||||||
showing whether aider with GPT-4o and with Opus
|
showing whether aider with GPT-4o and with Opus
|
||||||
produced plausible and/or correct solutions.
|
produced plausible and/or correct solutions.
|
||||||
|
|
||||||
|Row|Aider<br>w/GPT-4o<br>solution<br>plausible?|Aider<br>w/GPT-4o<br>solution<br>resolved<br>issue?|Aider<br>w/Opus<br>solution<br>plausible?|Aider<br>w/Opus<br>solution<br>resolved<br>issue?|Number of<br>problems<br>with this<br>outcome|
|
|Row|Aider<br>w/GPT-4o<br>solution<br>plausible?|Aider<br>w/GPT-4o<br>solution<br>resolved<br>issue?|Aider<br>w/Opus<br>solution<br>plausible?|Aider<br>w/Opus<br>solution<br>resolved<br>issue?|Number of<br>problems<br>with this<br>outcome|Number of<br>problems<br>resolved|
|
||||||
|:--:|--:|--:|--:|--:|--:|
|
|:--:|--:|--:|--:|--:|--:|--:|
|
||||||
| A | plausible | resolved | n/a | n/a | 73 |
|
| A | **plausible** | **resolved** | n/a | n/a | 73 | 73 |
|
||||||
| B | plausible | not resolved | n/a | n/a | 181 |
|
| B | **plausible** | not resolved | n/a | n/a | 181 | 0 |
|
||||||
| C | non-plausible | resolved | plausible | resolved | 1 |
|
| C | non-plausible | **resolved** | **plausible** | **resolved** | 1 | 1 |
|
||||||
| D | non-plausible | resolved | plausible | not resolved | 2 |
|
| D | non-plausible | **resolved** | **plausible** | not resolved | 2 | 0 |
|
||||||
| E | non-plausible | resolved | non-plausible | resolved | 16 |
|
| E | non-plausible | **resolved** | non-plausible | **resolved** | 16 | 16 |
|
||||||
| F | non-plausible | resolved | non-plausible | not resolved | 5 |
|
| F | non-plausible | **resolved** | non-plausible | not resolved | 5 | 3 |
|
||||||
| G | non-plausible | not resolved | non-plausible | resolved | 4 |
|
| G | non-plausible | not resolved | non-plausible | **resolved** | 4 | 2 |
|
||||||
| H | non-plausible | not resolved | non-plausible | not resolved | 216 |
|
| H | non-plausible | not resolved | non-plausible | not resolved | 216 | 0 |
|
||||||
| I | non-plausible | not resolved | plausible | resolved | 12 |
|
| I | non-plausible | not resolved | **plausible** | **resolved** | 12 | 12 |
|
||||||
| J | non-plausible | not resolved | plausible | not resolved | 53 |
|
| J | non-plausible | not resolved | **plausible** | not resolved | 53 | 0 |
|
||||||
| K | non-plausible | not resolved | n/a | n/a | 7 |
|
| K | non-plausible | not resolved | n/a | n/a | 7 | 0 |
|
||||||
|Total|||||570|
|
|Total|||||570|107|
|
||||||
|
|
||||||
Rows A-B show the cases where
|
Rows A-B show the cases where
|
||||||
aider with GPT-4o found a plausible solution during the first attempt.
|
aider with GPT-4o found a plausible solution during the first attempt.
|
||||||
|
|
Binary file not shown.
Before Width: | Height: | Size: 42 KiB After Width: | Height: | Size: 43 KiB |
|
@ -6,7 +6,7 @@
|
||||||
<rdf:RDF xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:cc="http://creativecommons.org/ns#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
|
<rdf:RDF xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:cc="http://creativecommons.org/ns#" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#">
|
||||||
<cc:Work>
|
<cc:Work>
|
||||||
<dc:type rdf:resource="http://purl.org/dc/dcmitype/StillImage"/>
|
<dc:type rdf:resource="http://purl.org/dc/dcmitype/StillImage"/>
|
||||||
<dc:date>2024-06-01T07:33:17.926838</dc:date>
|
<dc:date>2024-06-01T11:25:56.978629</dc:date>
|
||||||
<dc:format>image/svg+xml</dc:format>
|
<dc:format>image/svg+xml</dc:format>
|
||||||
<dc:creator>
|
<dc:creator>
|
||||||
<cc:Agent>
|
<cc:Agent>
|
||||||
|
@ -41,12 +41,12 @@ z
|
||||||
<g id="xtick_1">
|
<g id="xtick_1">
|
||||||
<g id="line2d_1">
|
<g id="line2d_1">
|
||||||
<defs>
|
<defs>
|
||||||
<path id="m71f437ae6a" d="M 0 0
|
<path id="m0820b1788b" d="M 0 0
|
||||||
L 0 3.5
|
L 0 3.5
|
||||||
" style="stroke: #000000; stroke-width: 0.8"/>
|
" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</defs>
|
</defs>
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m71f437ae6a" x="137.644385" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m0820b1788b" x="137.644385" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_1">
|
<g id="text_1">
|
||||||
|
@ -412,7 +412,7 @@ z
|
||||||
<g id="xtick_2">
|
<g id="xtick_2">
|
||||||
<g id="line2d_2">
|
<g id="line2d_2">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m71f437ae6a" x="219.596257" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m0820b1788b" x="219.596257" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_2">
|
<g id="text_2">
|
||||||
|
@ -583,7 +583,7 @@ z
|
||||||
<g id="xtick_3">
|
<g id="xtick_3">
|
||||||
<g id="line2d_3">
|
<g id="line2d_3">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m71f437ae6a" x="301.548128" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m0820b1788b" x="301.548128" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_3">
|
<g id="text_3">
|
||||||
|
@ -699,7 +699,7 @@ z
|
||||||
<g id="xtick_4">
|
<g id="xtick_4">
|
||||||
<g id="line2d_4">
|
<g id="line2d_4">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m71f437ae6a" x="383.5" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m0820b1788b" x="383.5" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_4">
|
<g id="text_4">
|
||||||
|
@ -894,7 +894,7 @@ z
|
||||||
<g id="xtick_5">
|
<g id="xtick_5">
|
||||||
<g id="line2d_5">
|
<g id="line2d_5">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m71f437ae6a" x="465.451872" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m0820b1788b" x="465.451872" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_5">
|
<g id="text_5">
|
||||||
|
@ -926,7 +926,7 @@ z
|
||||||
<g id="xtick_6">
|
<g id="xtick_6">
|
||||||
<g id="line2d_6">
|
<g id="line2d_6">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m71f437ae6a" x="547.403743" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m0820b1788b" x="547.403743" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_6">
|
<g id="text_6">
|
||||||
|
@ -1157,7 +1157,7 @@ z
|
||||||
<g id="xtick_7">
|
<g id="xtick_7">
|
||||||
<g id="line2d_7">
|
<g id="line2d_7">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m71f437ae6a" x="629.355615" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m0820b1788b" x="629.355615" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_7">
|
<g id="text_7">
|
||||||
|
@ -1339,16 +1339,16 @@ z
|
||||||
<g id="line2d_8">
|
<g id="line2d_8">
|
||||||
<path d="M 77 307.664
|
<path d="M 77 307.664
|
||||||
L 690 307.664
|
L 690 307.664
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_9">
|
<g id="line2d_9">
|
||||||
<defs>
|
<defs>
|
||||||
<path id="m402042e2ce" d="M 0 0
|
<path id="m57e939663f" d="M 0 0
|
||||||
L -3.5 0
|
L -3.5 0
|
||||||
" style="stroke: #000000; stroke-width: 0.8"/>
|
" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</defs>
|
</defs>
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="307.664" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_8">
|
<g id="text_8">
|
||||||
|
@ -1394,11 +1394,11 @@ z
|
||||||
<g id="line2d_10">
|
<g id="line2d_10">
|
||||||
<path d="M 77 275.08244
|
<path d="M 77 275.08244
|
||||||
L 690 275.08244
|
L 690 275.08244
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_11">
|
<g id="line2d_11">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="275.08244" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="275.08244" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_9">
|
<g id="text_9">
|
||||||
|
@ -1467,11 +1467,11 @@ z
|
||||||
<g id="line2d_12">
|
<g id="line2d_12">
|
||||||
<path d="M 77 242.500879
|
<path d="M 77 242.500879
|
||||||
L 690 242.500879
|
L 690 242.500879
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_13">
|
<g id="line2d_13">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="242.500879" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="242.500879" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_10">
|
<g id="text_10">
|
||||||
|
@ -1487,11 +1487,11 @@ L 690 242.500879
|
||||||
<g id="line2d_14">
|
<g id="line2d_14">
|
||||||
<path d="M 77 209.919319
|
<path d="M 77 209.919319
|
||||||
L 690 209.919319
|
L 690 209.919319
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_15">
|
<g id="line2d_15">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="209.919319" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="209.919319" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_11">
|
<g id="text_11">
|
||||||
|
@ -1523,11 +1523,11 @@ z
|
||||||
<g id="line2d_16">
|
<g id="line2d_16">
|
||||||
<path d="M 77 177.337759
|
<path d="M 77 177.337759
|
||||||
L 690 177.337759
|
L 690 177.337759
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_17">
|
<g id="line2d_17">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="177.337759" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="177.337759" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_12">
|
<g id="text_12">
|
||||||
|
@ -1557,11 +1557,11 @@ z
|
||||||
<g id="line2d_18">
|
<g id="line2d_18">
|
||||||
<path d="M 77 144.756199
|
<path d="M 77 144.756199
|
||||||
L 690 144.756199
|
L 690 144.756199
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_19">
|
<g id="line2d_19">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="144.756199" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="144.756199" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_13">
|
<g id="text_13">
|
||||||
|
@ -1578,11 +1578,11 @@ L 690 144.756199
|
||||||
<g id="line2d_20">
|
<g id="line2d_20">
|
||||||
<path d="M 77 112.174638
|
<path d="M 77 112.174638
|
||||||
L 690 112.174638
|
L 690 112.174638
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_21">
|
<g id="line2d_21">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="112.174638" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="112.174638" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_14">
|
<g id="text_14">
|
||||||
|
@ -1599,11 +1599,11 @@ L 690 112.174638
|
||||||
<g id="line2d_22">
|
<g id="line2d_22">
|
||||||
<path d="M 77 79.593078
|
<path d="M 77 79.593078
|
||||||
L 690 79.593078
|
L 690 79.593078
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
" clip-path="url(#pf392d01723)" style="fill: none; stroke: #b0b0b0; stroke-width: 0.2; stroke-linecap: square"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="line2d_23">
|
<g id="line2d_23">
|
||||||
<g>
|
<g>
|
||||||
<use xlink:href="#m402042e2ce" x="77" y="79.593078" style="stroke: #000000; stroke-width: 0.8"/>
|
<use xlink:href="#m57e939663f" x="77" y="79.593078" style="stroke: #000000; stroke-width: 0.8"/>
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_15">
|
<g id="text_15">
|
||||||
|
@ -1780,7 +1780,7 @@ L 170.425134 307.664
|
||||||
L 170.425134 170.821447
|
L 170.425134 170.821447
|
||||||
L 104.863636 170.821447
|
L 104.863636 170.821447
|
||||||
z
|
z
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: #b3d1e6; opacity: 0.3"/>
|
" clip-path="url(#pf392d01723)" style="fill: #b3e6a8; opacity: 0.3"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="patch_8">
|
<g id="patch_8">
|
||||||
<path d="M 186.815508 307.664
|
<path d="M 186.815508 307.664
|
||||||
|
@ -1788,7 +1788,7 @@ L 252.377005 307.664
|
||||||
L 252.377005 169.518184
|
L 252.377005 169.518184
|
||||||
L 186.815508 169.518184
|
L 186.815508 169.518184
|
||||||
z
|
z
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: #b3d1e6; opacity: 0.3"/>
|
" clip-path="url(#pf392d01723)" style="fill: #b3e6a8; opacity: 0.3"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="patch_9">
|
<g id="patch_9">
|
||||||
<path d="M 268.76738 307.664
|
<path d="M 268.76738 307.664
|
||||||
|
@ -1796,7 +1796,7 @@ L 334.328877 307.664
|
||||||
L 334.328877 144.756199
|
L 334.328877 144.756199
|
||||||
L 268.76738 144.756199
|
L 268.76738 144.756199
|
||||||
z
|
z
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: #b3d1e6; opacity: 0.3"/>
|
" clip-path="url(#pf392d01723)" style="fill: #b3e6a8; opacity: 0.3"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="patch_10">
|
<g id="patch_10">
|
||||||
<path d="M 350.719251 307.664
|
<path d="M 350.719251 307.664
|
||||||
|
@ -1804,7 +1804,7 @@ L 416.280749 307.664
|
||||||
L 416.280749 127.813787
|
L 416.280749 127.813787
|
||||||
L 350.719251 127.813787
|
L 350.719251 127.813787
|
||||||
z
|
z
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: #b3d1e6; opacity: 0.3"/>
|
" clip-path="url(#pf392d01723)" style="fill: #b3e6a8; opacity: 0.3"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="patch_11">
|
<g id="patch_11">
|
||||||
<path d="M 432.671123 307.664
|
<path d="M 432.671123 307.664
|
||||||
|
@ -1812,7 +1812,7 @@ L 498.23262 307.664
|
||||||
L 498.23262 126.510525
|
L 498.23262 126.510525
|
||||||
L 432.671123 126.510525
|
L 432.671123 126.510525
|
||||||
z
|
z
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: #b3d1e6; opacity: 0.3"/>
|
" clip-path="url(#pf392d01723)" style="fill: #b3e6a8; opacity: 0.3"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="patch_12">
|
<g id="patch_12">
|
||||||
<path d="M 514.622995 307.664
|
<path d="M 514.622995 307.664
|
||||||
|
@ -1820,7 +1820,7 @@ L 580.184492 307.664
|
||||||
L 580.184492 86.10939
|
L 580.184492 86.10939
|
||||||
L 514.622995 86.10939
|
L 514.622995 86.10939
|
||||||
z
|
z
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: #17965a; opacity: 0.9"/>
|
" clip-path="url(#pf392d01723)" style="fill: #155f91; opacity: 0.9"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="patch_13">
|
<g id="patch_13">
|
||||||
<path d="M 596.574866 307.664
|
<path d="M 596.574866 307.664
|
||||||
|
@ -1828,7 +1828,7 @@ L 662.136364 307.664
|
||||||
L 662.136364 62.650667
|
L 662.136364 62.650667
|
||||||
L 596.574866 62.650667
|
L 596.574866 62.650667
|
||||||
z
|
z
|
||||||
" clip-path="url(#p73b7253dcf)" style="fill: #17965a; opacity: 0.9"/>
|
" clip-path="url(#pf392d01723)" style="fill: #155f91; opacity: 0.9"/>
|
||||||
</g>
|
</g>
|
||||||
<g id="text_17">
|
<g id="text_17">
|
||||||
<!-- 10.5% -->
|
<!-- 10.5% -->
|
||||||
|
@ -2356,7 +2356,7 @@ z
|
||||||
</g>
|
</g>
|
||||||
</g>
|
</g>
|
||||||
<defs>
|
<defs>
|
||||||
<clipPath id="p73b7253dcf">
|
<clipPath id="pf392d01723">
|
||||||
<rect x="77" y="50.4" width="613" height="257.264"/>
|
<rect x="77" y="50.4" width="613" height="257.264"/>
|
||||||
</clipPath>
|
</clipPath>
|
||||||
</defs>
|
</defs>
|
||||||
|
|
Before Width: | Height: | Size: 57 KiB After Width: | Height: | Size: 57 KiB |
|
@ -51,7 +51,11 @@ def plot_swe_bench(data_file, is_lite):
|
||||||
spine.set_edgecolor("#DDDDDD")
|
spine.set_edgecolor("#DDDDDD")
|
||||||
spine.set_linewidth(0.5)
|
spine.set_linewidth(0.5)
|
||||||
|
|
||||||
colors = ["#17965A" if "Aider" in model else "#b3d1e6" for model in models]
|
if is_lite:
|
||||||
|
colors = ["#17965A" if "Aider" in model else "#b3d1e6" for model in models]
|
||||||
|
else:
|
||||||
|
colors = ["#155F91" if "Aider" in model else "#b3e6a8" for model in models]
|
||||||
|
|
||||||
bars = []
|
bars = []
|
||||||
for model, pass_rate, color in zip(models, pass_rates, colors):
|
for model, pass_rate, color in zip(models, pass_rates, colors):
|
||||||
alpha = 0.9 if "Aider" in model else 0.3
|
alpha = 0.9 if "Aider" in model else 0.3
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue