This commit is contained in:
Paul Gauthier 2024-12-22 16:14:33 -05:00
parent b1bc2f8c5c
commit f55181e447

View file

@ -17,11 +17,11 @@ nav_exclude: true
OpenAI's new o1 model with "high" reasoning effort OpenAI's new o1 model with "high" reasoning effort
gets the top score on the gets the top score on the
new new
[aider polyglot leaderboard](/docs/leaderboard/), significantly ahead of [aider polyglot leaderboard](/docs/leaderboards/), significantly ahead of
other top LLMs. other top LLMs.
The new polyglot benchmark was designed to be The new polyglot benchmark was designed to be
*much more challenging* than aider's old *much more challenging* than aider's old
[code editing benchmark](/docs/leaderboard/edit.html). [code editing benchmark](/docs/leaderboards/edit.html).
This more clearly distinguishes This more clearly distinguishes
the performance of the performance of
today's strongest coding models and today's strongest coding models and