From 63a4ce0fe28c00b8f770c82e4f182aa4d0ce5a9f Mon Sep 17 00:00:00 2001 From: "Paul Gauthier (aider)" Date: Thu, 26 Sep 2024 09:39:16 -0700 Subject: [PATCH] feat: Add a graph that displays the Pass Rate for each Senior/Junior/EditFormat --- .../_posts/2024-09-26-senior-junior.md | 77 ++++++++++++++++++- 1 file changed, 76 insertions(+), 1 deletion(-) diff --git a/aider/website/_posts/2024-09-26-senior-junior.md b/aider/website/_posts/2024-09-26-senior-junior.md index defe27235..05115cb3a 100644 --- a/aider/website/_posts/2024-09-26-senior-junior.md +++ b/aider/website/_posts/2024-09-26-senior-junior.md @@ -42,10 +42,85 @@ pair programming AI coding experience. ## Results -The graph above and table below show the +The graph below and table show the [aider's code editing benchmark](/docs/benchmarks.html#the-benchmark) score for various combinations of Senior and Junior models. +
+ +
+ + + + Some noteworthy observations: - o1-preview with Deepseek as the Junior surprises as the SOTA result, beating other stronger Junior models. This result is obtained with Deepseek using the "whole" editing format, requiring it to output a full update copy of each edited source file. This is quite slow, and so probably not practical for interactive use with aider.