copy

2025-06-09 14:14:59 +00:00 · 2024-11-21 13:07:18 -08:00 · 2024-11-21 13:07:18 -08:00 · 3cfbaa0ed6
commit 3cfbaa0ed6
parent e1b4571fdf
1 changed files with 1 additions and 1 deletions
--- a/aider/website/_posts/2024-11-21-quantization.md
+++ b/aider/website/_posts/2024-11-21-quantization.md
@ -28,7 +28,7 @@ The graph above compares 4 different versions of the Qwen 2.5 32B model,
 served both locally and from cloud providers.

 - The [HuggingFace weights](https://huggingface.co/Qwen/Qwen2.5-Coder-32B-Instruct) served via [glhf.chat](https://glhf.chat).
- The results from [OpenRouter's mix of providers](https://openrouter.ai/qwen/qwen-2.5-coder-32b-instruct/providers).
+- The results from [OpenRouter's mix of providers](https://openrouter.ai/qwen/qwen-2.5-coder-32b-instruct/providers) which serve the model with different levels of quantization.
 - Two Ollama models run locally.

 The best version of the model rivals GPT-4o, while the worst performer