From 6d75dc62d79c04611e2d915f9b957e13a1bc6332 Mon Sep 17 00:00:00 2001 From: Paul Gauthier Date: Mon, 13 May 2024 11:06:02 -0700 Subject: [PATCH] copy --- docs/leaderboards/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/leaderboards/index.md b/docs/leaderboards/index.md index 1de2f7f79..95772d03e 100644 --- a/docs/leaderboards/index.md +++ b/docs/leaderboards/index.md @@ -17,7 +17,7 @@ it works best with models that score well on the benchmarks. ## GPT-4o -GPT-4o tops the aider LLM code editing leaderboard at 72.9%, versus 68.4% for Opus. GPT-4o takes second on aider's refactoring leaderboard with XX, versus Opus at 72.3%. +GPT-4o tops the aider LLM code editing leaderboard at 72.9%, versus 68.4% for Opus. GPT-4o takes second on aider's refactoring leaderboard with 62.9%, versus Opus at 72.3%. GPT-4o did much better than the 4-turbo models, and seems *much* less lazy.