mirror of
https://github.com/Aider-AI/aider.git
synced 2025-06-01 02:05:00 +00:00
copy
This commit is contained in:
parent
4894914db1
commit
dc8761763d
2 changed files with 10 additions and 2 deletions
|
@ -10,6 +10,7 @@ nav_exclude: true
|
||||||
{% endif %}
|
{% endif %}
|
||||||
|
|
||||||
# Quantization matters
|
# Quantization matters
|
||||||
|
{: .no_toc }
|
||||||
|
|
||||||
Open source models like Qwen 2.5 32B Instruct are performing very well on
|
Open source models like Qwen 2.5 32B Instruct are performing very well on
|
||||||
aider's code editing benchmark, rivaling closed source frontier models.
|
aider's code editing benchmark, rivaling closed source frontier models.
|
||||||
|
@ -18,8 +19,7 @@ can impact code editing skill.
|
||||||
Heavily quantized models are often used by cloud API providers
|
Heavily quantized models are often used by cloud API providers
|
||||||
and local model servers like Ollama or MLX.
|
and local model servers like Ollama or MLX.
|
||||||
|
|
||||||
|
The graph and table below compares different versions of the Qwen 2.5 Coder 32B Instruct model,
|
||||||
The graph above compares different versions of the Qwen 2.5 Coder 32B Instruct model,
|
|
||||||
served both locally and from cloud providers.
|
served both locally and from cloud providers.
|
||||||
|
|
||||||
- The [HuggingFace BF16 weights](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct) served via [glhf.chat](https://glhf.chat).
|
- The [HuggingFace BF16 weights](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct) served via [glhf.chat](https://glhf.chat).
|
||||||
|
@ -38,9 +38,17 @@ It's unclear why this is happening to just this provider.
|
||||||
The other providers available through OpenRouter perform similarly
|
The other providers available through OpenRouter perform similarly
|
||||||
when their API is accessed directly.
|
when their API is accessed directly.
|
||||||
|
|
||||||
|
### Sections
|
||||||
|
{: .no_toc }
|
||||||
|
|
||||||
|
- TOC
|
||||||
|
{:toc}
|
||||||
|
|
||||||
{: .note }
|
{: .note }
|
||||||
This article is being updated as additional benchmark runs complete.
|
This article is being updated as additional benchmark runs complete.
|
||||||
|
|
||||||
|
## Benchmark results
|
||||||
|
|
||||||
<canvas id="quantChart" width="800" height="600" style="margin: 20px 0"></canvas>
|
<canvas id="quantChart" width="800" height="600" style="margin: 20px 0"></canvas>
|
||||||
<script src="https://cdn.jsdelivr.net/npm/chart.js"></script>
|
<script src="https://cdn.jsdelivr.net/npm/chart.js"></script>
|
||||||
<script>
|
<script>
|
||||||
|
|
Binary file not shown.
Before Width: | Height: | Size: 118 KiB After Width: | Height: | Size: 148 KiB |
Loading…
Add table
Add a link
Reference in a new issue