mirror of
https://github.com/Aider-AI/aider.git
synced 2025-05-30 17:24:59 +00:00
copy
This commit is contained in:
parent
8c74c8ab6f
commit
7f0860d5d0
2 changed files with 5 additions and 6 deletions
|
@ -188,7 +188,7 @@ on GitHub.
|
||||||
<script>
|
<script>
|
||||||
{% assign data_source = edit_sorted %}
|
{% assign data_source = edit_sorted %}
|
||||||
{% assign pass_rate_field = "pass_rate_2" %}
|
{% assign pass_rate_field = "pass_rate_2" %}
|
||||||
{% assign highlight_model = "o1" %}
|
{% assign highlight_model = "o1-2024" %}
|
||||||
{% include leaderboard.js %}
|
{% include leaderboard.js %}
|
||||||
</script>
|
</script>
|
||||||
<style>
|
<style>
|
||||||
|
|
|
@ -7,8 +7,7 @@ This directory holds the harness and tools needed to run the benchmarking suite.
|
||||||
|
|
||||||
## Background
|
## Background
|
||||||
|
|
||||||
The benchmark is based on the [Exercism
|
The benchmark is based on the [Exercism](https://github.com/exercism/python) coding exercises.
|
||||||
python](https://github.com/exercism/python) coding exercises.
|
|
||||||
This
|
This
|
||||||
benchmark evaluates how effectively aider and GPT can translate a
|
benchmark evaluates how effectively aider and GPT can translate a
|
||||||
natural language coding request into executable code saved into
|
natural language coding request into executable code saved into
|
||||||
|
@ -18,7 +17,7 @@ GPT's coding ability, but also its capacity to *edit existing code*
|
||||||
and *format those code edits* so that aider can save the
|
and *format those code edits* so that aider can save the
|
||||||
edits to the local source files.
|
edits to the local source files.
|
||||||
|
|
||||||
See [this writeup for a longer discussion about the benchmark and how to interpret the results](https://aider.chat/docs/benchmarks.html).
|
See [this writeup for a longer discussion about the benchmark](https://aider.chat/2024/12/21/polyglot.html)
|
||||||
|
|
||||||
The benchmark is intended to be run *inside a docker container*.
|
The benchmark is intended to be run *inside a docker container*.
|
||||||
This is because the benchmarking harness will be
|
This is because the benchmarking harness will be
|
||||||
|
@ -50,8 +49,8 @@ git clone git@github.com:Aider-AI/aider.git
|
||||||
cd aider
|
cd aider
|
||||||
mkdir tmp.benchmarks
|
mkdir tmp.benchmarks
|
||||||
|
|
||||||
# Clone the exercism repo
|
# Clone the repo with the exercises
|
||||||
git clone git@github.com:exercism/python.git
|
|
||||||
|
|
||||||
# Copy the practice exercises into the benchmark scratch dir
|
# Copy the practice exercises into the benchmark scratch dir
|
||||||
cp -rp python/exercises/practice tmp.benchmarks/exercism-python
|
cp -rp python/exercises/practice tmp.benchmarks/exercism-python
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue