This commit is contained in:
Paul Gauthier 2024-06-05 16:25:11 -07:00
parent 9de32c88c3
commit c0700161bc
18 changed files with 68 additions and 4 deletions

View file

@ -4,6 +4,10 @@ excerpt: The new `gpt-4-0125-preview` model is quantiatively lazier at coding th
highlight_image: /assets/benchmarks-0125.jpg
nav_exclude: true
---
{% if page.date %}
<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
{% endif %}
# The January GPT-4 Turbo is lazier than the last version
[![benchmark results](/assets/benchmarks-0125.svg)](https://aider.chat/assets/benchmarks-0125.svg)

View file

@ -4,6 +4,10 @@ excerpt: A quantitative comparison of the code editing capabilities of the new G
highlight_image: /assets/benchmarks-1106.jpg
nav_exclude: true
---
{% if page.date %}
<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
{% endif %}
# Code editing benchmarks for OpenAI's "1106" models
[![benchmark results](/assets/benchmarks-1106.svg)](https://aider.chat/assets/benchmarks-1106.svg)

View file

@ -5,6 +5,10 @@ canonical_url: https://aider.chat/2023/11/06/benchmarks-speed-1106.html
highlight_image: /assets/benchmarks-speed-1106.jpg
nav_exclude: true
---
{% if page.date %}
<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
{% endif %}
# Speed benchmarks of GPT-4 Turbo and gpt-3.5-turbo-1106
<p class="post-date">{{ page.date | date: "%b %-d, %Y" }}</p>

View file

@ -4,6 +4,10 @@ excerpt: Benchmarking GPT-3.5 and GPT-4 code editing skill using a new code edit
highlight_image: /assets/benchmarks.jpg
nav_exclude: true
---
{% if page.date %}
<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
{% endif %}
# GPT code editing benchmarks
[![benchmark results](/assets/benchmarks.svg)](https://aider.chat/assets/benchmarks.svg)

View file

@ -4,6 +4,10 @@ excerpt: Using ctags to build a "repository map" to increase GPT-4's ability to
highlight_image: /assets/robot-flowchart.png
nav_exclude: true
---
{% if page.date %}
<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
{% endif %}
# Improving GPT-4's codebase understanding with ctags
![robot flowchat](/assets/robot-flowchart.png)

View file

@ -4,6 +4,10 @@ excerpt: Tree-sitter allows aider to build a repo map that better summarizes lar
highlight_image: /assets/robot-ast.png
nav_exclude: true
---
{% if page.date %}
<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
{% endif %}
# Building a better repository map with tree sitter
![robot flowchat](/assets/robot-ast.png)

View file

@ -4,6 +4,10 @@ excerpt: GPT-4 Turbo has a problem with lazy coding, which can be signiciantly i
highlight_image: /assets/benchmarks-udiff.jpg
nav_exclude: true
---
{% if page.date %}
<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
{% endif %}
# Unified diffs make GPT-4 Turbo 3X less lazy
![robot flowchart](/assets/benchmarks-udiff.svg)