Update benchmarks-1106.md

This commit is contained in:
paul-gauthier 2023-11-06 19:50:23 -08:00 committed by GitHub
parent a2d52536a5
commit c8b95b486f
No known key found for this signature in database
GPG key ID: 4AEE18F83AFDEB23

View file

@ -16,9 +16,9 @@ For example,
whenever I change aider's prompting or the backend which drives LLM conversations,
I run the benchmark to make sure these changes produce improvements (not regressions).
The benchmark asks GPT to complete the
[Exercism Python coding exercises](https://github.com/exercism/python).
Exercism provides a starting python file with stubs for the needed functions,
The benchmark asks GPT to complete
[133 Exercism Python coding exercises](https://github.com/exercism/python).
For each exercise, Exercism provides a starting python file with stubs for the needed functions,
a natural language description of the problem to solve
and a test suite to evaluate whether the coder has correctly solved the problem.