diff --git a/docs/benchmarks.md b/docs/benchmarks.md index e8cae7068..0ac152f8f 100644 --- a/docs/benchmarks.md +++ b/docs/benchmarks.md @@ -313,17 +313,3 @@ cause a large variance in the overall benchmark results. Based on these benchmarking results, aider will continue to use the `whole` edit format for GPT-3.5, and `diff` for GPT-4. -While GPT-4 gets somewhat better results with the `whole` edit format, -it significantly increases costs and latency compared to `diff`. - -The latency of streaming back the entire updated copy of each edited file -is the real challenge. The GPT-3.5 models are quite responsive, and can -stream back entire files at an acceptable speed. -Aider displays a progress bar and -live diffs of the files as they stream in, -which helps pass the time. - -The GPT-4 models are much slower, and waiting for even small files -to be completely "retyped" on each request is probably unacceptable. - -