diff --git a/aider/website/_posts/2024-08-26-sonnet-seems-fine.md b/aider/website/_posts/2024-08-26-sonnet-seems-fine.md index c9c877d91..fc77e8e6e 100644 --- a/aider/website/_posts/2024-08-26-sonnet-seems-fine.md +++ b/aider/website/_posts/2024-08-26-sonnet-seems-fine.md @@ -17,4 +17,67 @@ There has been a lot of speculation that Sonnet has been dumbed-down, nerfed or otherwise performing worse lately. Sonnet seems as good as ever. +Here's a graph showing the performance of Claude 3.5 Sonnet over time: + +
+ +
+ + + + +This graph shows the performance of Claude 3.5 Sonnet on the aider code editing benchmark over time. 'Pass Rate 1' represents the initial success rate, while 'Pass Rate 2' shows the success rate after a second attempt. As you can see, there's no significant decline in performance, suggesting that Sonnet's capabilities have remained stable since its launch. +