init

2025-05-28 16:25:00 +00:00 · 2024-11-21 10:55:20 -08:00 · 2024-11-21 10:55:20 -08:00 · 8eda09533d
commit 8eda09533d
parent 6c42ee4edf
2 changed files with 27 additions and 0 deletions
--- a/aider/website/_posts/2024-11-21-quantization.md
+++ b/aider/website/_posts/2024-11-21-quantization.md
@ -0,0 +1,27 @@
+---
+title: Quantization matters
+excerpt: Open source LLMs are becoming very powerful, but pay attention to how you (or your) provider is quantizing the model. It strongly affects code editing skill.
+highlight_image: /assets/quantization.jpg
+draft: false
+nav_exclude: true
+---
+{% if page.date %}
+<p class="post-date">{{ page.date | date: "%B %d, %Y" }}</p>
+{% endif %}
+
+# Quantization matters
+
+Open source models like Qwen 2.5 32B are performing very well on
+aider's code editing benchmark, rivaling closed source frontier models.
+But pay attention to how your model is being quantized, as it
+can strongly impact code editing skill.
+Heavily quantized models are often used by cloud API providers
+and local model servers like ollama.
+
+The graph below compares 4 different versions of the Qwen 2.5 32B model,
+served both locally and from cloud providers:
+
+- Qwen2.5-Coder-32B-Instruct
+- ollama/qwen2.5:32b
+- ollama/qwen2.5:32b-instruct-q8_0
+- openrouter/qwen/qwen-2.5-coder-32b-instruct
--- a/aider/website/assets/quantization.jpg
+++ b/aider/website/assets/quantization.jpg