This commit is contained in:
Paul Gauthier 2024-12-22 16:14:33 -05:00
parent b1bc2f8c5c
commit f55181e447

View file

@ -17,11 +17,11 @@ nav_exclude: true
OpenAI's new o1 model with "high" reasoning effort
gets the top score on the
new
[aider polyglot leaderboard](/docs/leaderboard/), significantly ahead of
[aider polyglot leaderboard](/docs/leaderboards/), significantly ahead of
other top LLMs.
The new polyglot benchmark was designed to be
*much more challenging* than aider's old
[code editing benchmark](/docs/leaderboard/edit.html).
[code editing benchmark](/docs/leaderboards/edit.html).
This more clearly distinguishes
the performance of
today's strongest coding models and