aider/benchmark/swe_bench_lite.py
Paul Gauthier 5761218b9a aider: Add imgcat to display the graph after saving it.
# Aider chat conversation:

USER: save it as a png and svg and imgcat it out

ASSISTANT: To save the graph as a PNG and SVG and then display it using `imgcat`, we need to modify `benchmark/swe_bench_lite.py`:

1. Import `imgcat`.
2. Add `imgcat` to display the graph after saving it.

Here are the *SEARCH/REPLACE* blocks:

benchmark/swe_bench_lite.py
```python
<<<<<<< SEARCH
from matplotlib import rc
=======
from matplotlib import rc
from imgcat import imgcat
>>>>>>> REPLACE
```

benchmark/swe_bench_lite.py
```python
<<<<<<< SEARCH
    plt.show()
=======
    imgcat(fig)
    plt.show()
>>>>>>> REPLACE
```
2024-05-22 15:13:01 -07:00

49 lines
1.5 KiB
Python

import matplotlib.pyplot as plt
from matplotlib import rc
from imgcat import imgcat
def plot_swe_bench_lite(data_file):
with open(data_file, "r") as file:
lines = file.readlines()
models = []
pass_rates = []
for line in lines:
if line.strip():
pass_rate, model = line.split("%")
models.append(model.strip())
pass_rates.append(float(pass_rate.strip()))
plt.rcParams["hatch.linewidth"] = 0.5
plt.rcParams["hatch.color"] = "#444444"
rc("font", **{"family": "sans-serif", "sans-serif": ["Helvetica"], "size": 10})
fig, ax = plt.subplots(figsize=(10, 5))
ax.grid(axis="y", zorder=0, lw=0.2)
for spine in ax.spines.values():
spine.set_edgecolor("#DDDDDD")
spine.set_linewidth(0.5)
colors = [
"red" if "Aider" in model else "blue" for model in models
]
bars = ax.bar(models, pass_rates, color=colors, alpha=0.5, zorder=3)
for bar in bars:
yval = bar.get_height()
ax.text(bar.get_x() + bar.get_width()/2, yval + 0.5, f'{yval}%', ha='center', va='bottom', fontsize=12, alpha=0.75)
ax.set_xlabel("Models", fontsize=18)
ax.set_ylabel("Pass Rate (%)", fontsize=18)
ax.set_title("SWE Bench Lite pass rates", fontsize=20)
plt.xticks(rotation=45, ha='right')
plt.tight_layout()
plt.savefig("swe_bench_lite.png")
plt.savefig("swe_bench_lite.svg")
imgcat(fig)
plt.show()
# Example usage
plot_swe_bench_lite("benchmark/tmp.txt")