From 7a12ca90d4a4bacdea5c132dbfade7c4b86fc43c Mon Sep 17 00:00:00 2001 From: Paul Gauthier Date: Thu, 26 Sep 2024 12:53:04 -0700 Subject: [PATCH] copy --- .../_posts/2024-09-26-senior-junior.md | 27 +++++++++---------- 1 file changed, 13 insertions(+), 14 deletions(-) diff --git a/aider/website/_posts/2024-09-26-senior-junior.md b/aider/website/_posts/2024-09-26-senior-junior.md index 8217a850c..4264dd29f 100644 --- a/aider/website/_posts/2024-09-26-senior-junior.md +++ b/aider/website/_posts/2024-09-26-senior-junior.md @@ -13,15 +13,13 @@ nav_exclude: true Aider now has experimental support for using two models to complete each coding task: -- A Senior model is asked to describe how to solve the coding problem in detail. +- A Senior model is asked to describe how to solve the coding problem. - A Junior model is given the Senior's solution and asked to produce specific code editing instructions to apply those changes to source files. Splitting up "code reasoning" and "code editing" has produced SOTA results on [aider's code editing benchmark](/docs/benchmarks.html#the-benchmark). -Both Sonnet and o1-preview exceed the previous SOTA when using this -new Senior/Junior approach. -The best result was obtained with -o1-preview as Senior and Deepseek as Junior, raising the SOTA from 79.7% up to 85%! +It also significantly improved the benchmark scores of four of the +top coding models, as compared to their previous "solo" scores (striped bars).