From 8816d22591d96ac98f254c504906b295e13a33bd Mon Sep 17 00:00:00 2001 From: "Paul Gauthier (aider)" Date: Mon, 26 Aug 2024 20:25:34 -0700 Subject: [PATCH] feat: Add line graph to visualize Sonnet performance over time --- .../_posts/2024-08-26-sonnet-seems-fine.md | 63 +++++++++++++++++++ 1 file changed, 63 insertions(+) diff --git a/aider/website/_posts/2024-08-26-sonnet-seems-fine.md b/aider/website/_posts/2024-08-26-sonnet-seems-fine.md index c9c877d91..fc77e8e6e 100644 --- a/aider/website/_posts/2024-08-26-sonnet-seems-fine.md +++ b/aider/website/_posts/2024-08-26-sonnet-seems-fine.md @@ -17,4 +17,67 @@ There has been a lot of speculation that Sonnet has been dumbed-down, nerfed or otherwise performing worse lately. Sonnet seems as good as ever. +Here's a graph showing the performance of Claude 3.5 Sonnet over time: + +
+ +
+ + + + +This graph shows the performance of Claude 3.5 Sonnet on the aider code editing benchmark over time. 'Pass Rate 1' represents the initial success rate, while 'Pass Rate 2' shows the success rate after a second attempt. As you can see, there's no significant decline in performance, suggesting that Sonnet's capabilities have remained stable since its launch. +