diff --git a/docs/benchmarks-1106.md b/docs/benchmarks-1106.md index 838936ed7..aaa105019 100644 --- a/docs/benchmarks-1106.md +++ b/docs/benchmarks-1106.md @@ -17,7 +17,7 @@ whenever I change aider's prompting or the backend which drives LLM conversation I run the benchmark to make sure these changes produce improvements (not regressions). The benchmark asks GPT to complete the -[Exercism Python coding exercises](). +[Exercism Python coding exercises](https://github.com/exercism/python). Exercism provides a starting python file with stubs for the needed functions, a natural language description of the problem to solve and a test suite to evaluate whether the coder has correctly solved the problem.