mirror of
https://github.com/Aider-AI/aider.git
synced 2025-05-28 16:25:00 +00:00
copy
This commit is contained in:
parent
8c74c8ab6f
commit
7f0860d5d0
2 changed files with 5 additions and 6 deletions
|
@ -188,7 +188,7 @@ on GitHub.
|
|||
<script>
|
||||
{% assign data_source = edit_sorted %}
|
||||
{% assign pass_rate_field = "pass_rate_2" %}
|
||||
{% assign highlight_model = "o1" %}
|
||||
{% assign highlight_model = "o1-2024" %}
|
||||
{% include leaderboard.js %}
|
||||
</script>
|
||||
<style>
|
||||
|
|
|
@ -7,8 +7,7 @@ This directory holds the harness and tools needed to run the benchmarking suite.
|
|||
|
||||
## Background
|
||||
|
||||
The benchmark is based on the [Exercism
|
||||
python](https://github.com/exercism/python) coding exercises.
|
||||
The benchmark is based on the [Exercism](https://github.com/exercism/python) coding exercises.
|
||||
This
|
||||
benchmark evaluates how effectively aider and GPT can translate a
|
||||
natural language coding request into executable code saved into
|
||||
|
@ -18,7 +17,7 @@ GPT's coding ability, but also its capacity to *edit existing code*
|
|||
and *format those code edits* so that aider can save the
|
||||
edits to the local source files.
|
||||
|
||||
See [this writeup for a longer discussion about the benchmark and how to interpret the results](https://aider.chat/docs/benchmarks.html).
|
||||
See [this writeup for a longer discussion about the benchmark](https://aider.chat/2024/12/21/polyglot.html)
|
||||
|
||||
The benchmark is intended to be run *inside a docker container*.
|
||||
This is because the benchmarking harness will be
|
||||
|
@ -50,8 +49,8 @@ git clone git@github.com:Aider-AI/aider.git
|
|||
cd aider
|
||||
mkdir tmp.benchmarks
|
||||
|
||||
# Clone the exercism repo
|
||||
git clone git@github.com:exercism/python.git
|
||||
# Clone the repo with the exercises
|
||||
|
||||
|
||||
# Copy the practice exercises into the benchmark scratch dir
|
||||
cp -rp python/exercises/practice tmp.benchmarks/exercism-python
|
||||
|
|
Loading…
Add table
Add a link
Reference in a new issue