aider

mirror of https://github.com/Aider-AI/aider.git synced 2025-05-28 16:25:00 +00:00

History

Paul Gauthier c7ba0a5dba aider: Modified show_stats function to plot a bar graph with the average of "pass_rate_1" for each "edit_format" within each model, grouping the bars for each model together. # Context: USER: don't combine the edit formats. plot a different bar for each model/edit_format combo. But group the bars for each model together. ASSISTANT: To implement this change, we need to modify the `show_stats` function in the `benchmark/benchmark.py` file. We will use the pandas and matplotlib libraries to create the bar plot. Here are the steps: 1. In the `show_stats` function, after creating the DataFrame `df`, sort it based on "model" and "edit_format". 2. Group the DataFrame by "model" and "edit_format" and plot a bar graph for "pass_rate_1". Here are the changes: ```python benchmark/benchmark.py <<<<<<< ORIGINAL def show_stats(dirnames): rows = [] for dirname in dirnames: row = summarize_results(dirname) rows.append(vars(row)) df = pd.DataFrame.from_records(rows) df.sort_values(by=["model", "edit_format"], inplace=True) df_grouped = df.groupby("model")["pass_rate_1"].mean() df_grouped.plot(kind='barh', figsize=(10, 6)) plt.xlabel('Pass Rate 1') plt.ylabel('Model') plt.title('Pass Rate 1 for each Model/Edit Format') plt.show() df.to_csv("tmp.benchmark.csv") ======= def show_stats(dirnames): rows = [] for dirname in dirnames: row = summarize_results(dirname) rows.append(vars(row)) df = pd.DataFrame.from_records(rows) df.sort_values(by=["model", "edit_format"], inplace=True) df_grouped = df.groupby(["model", "edit_format"])["pass_rate_1"].mean() df_grouped.unstack().plot(kind='barh', figsize=(10, 6)) plt.xlabel('Pass Rate 1') plt.ylabel('Model') plt.title('Pass Rate 1 for each Model/Edit Format') plt.legend(title='Edit Format') plt.show() df.to_csv("tmp.benchmark.csv") >>>>>>> UPDATED ``` This will create a bar graph with the average of "pass_rate_1" for each "edit_format" within each model. The bars for each model are grouped together.		2023-06-29 18:52:08 -07:00
..
__init__.py	initial	2023-06-28 18:32:09 -07:00
benchmark.py	aider: Modified show_stats function to plot a bar graph with the average of "pass_rate_1" for each "edit_format" within each model, grouping the bars for each model together.	2023-06-29 18:52:08 -07:00
docker.sh	AIDER_BENCHMARK_DIR	2023-06-29 18:13:54 -07:00
docker_build.sh	docker environment for the whole benchmark	2023-06-29 14:19:00 -07:00
Dockerfile	pull summary data into a df	2023-06-29 18:48:20 -07:00
prompts.py	move benchmark prompts into their own file	2023-06-28 18:25:39 -07:00
test_benchmark.py	remove long ==--- lines to save tokens	2023-06-28 15:44:23 -07:00