diff --git a/aider/website/_posts/2024-08-26-sonnet-seems-fine.md b/aider/website/_posts/2024-08-26-sonnet-seems-fine.md index 06b48084f..e915253d8 100644 --- a/aider/website/_posts/2024-08-26-sonnet-seems-fine.md +++ b/aider/website/_posts/2024-08-26-sonnet-seems-fine.md @@ -19,7 +19,12 @@ dumbed-down, nerfed or otherwise performing worse lately. Sonnet seems as good as ever, at least when accessed via the API. -Here's a graph showing the performance of Claude 3.5 Sonnet over time: +Here's a graph showing the performance of Claude 3.5 Sonnet over time. +It shows every benchmark run performed since Sonnet launched. +Benchmarks were performed for various reasons, usually +to evaluate the effects of small changes to aider's system prompts. +There is always some variance in benchmark results, typically +/- 1-2% +between runs with identical prompts.