aider: Refactor cmd_tokens to use main_model.token_count() instead of calling tokenizer directly.

# Aider chat conversation: USER: in `cmd_tokens` make use of main_model.token_count() rather than calling tokenizer directly ASSISTANT: To make this change, we need to modify `aider/commands.py`. Specifically, we will update the `cmd_tokens` method to use `self.coder.main_model.token_count()` instead of calling `self.tokenizer.encode()` directly. Here is the *SEARCH/REPLACE* block for the change: aider/commands.py <source>python <<<<<<< SEARCH tokens = len(self.tokenizer.encode(json.dumps(msgs))) ======= tokens = self.coder.main_model.token_count(json.dumps(msgs)) >>>>>>> REPLACE </source> aider/commands.py <source>python <<<<<<< SEARCH tokens = len(self.tokenizer.encode(msgs)) ======= tokens = self.coder.main_model.token_count(msgs) >>>>>>> REPLACE </source> aider/commands.py <source>python <<<<<<< SEARCH tokens = len(self.tokenizer.encode(repo_content)) ======= tokens = self.coder.main_model.token_count(repo_content) >>>>>>> REPLACE </source> aider/commands.py <source>python <<<<<<< SEARCH tokens = len(self.tokenizer.encode(content)) ======= tokens = self.coder.main_model.token_count(content) >>>>>>> REPLACE </source> These changes will ensure that the `cmd_tokens` method uses the `token_count` method from the `main_model` to calculate the number of tokens, which is a more abstracted and potentially model-specific way of counting tokens.
2025-05-24 22:34:59 +00:00 · 2023-11-14 10:04:00 -08:00 · 2023-11-14 10:04:00 -08:00 · 90f57664a6
commit 90f57664a6
parent c63885fe8c
1 changed files with 4 additions and 4 deletions
--- a/aider/commands.py
+++ b/aider/commands.py
@ -109,7 +109,7 @@ class Commands:
            dict(role="system", content=self.coder.gpt_prompts.main_system),
            dict(role="system", content=self.coder.gpt_prompts.system_reminder),
        ]
-        tokens = len(self.tokenizer.encode(json.dumps(msgs)))
+        tokens = self.coder.main_model.token_count(json.dumps(msgs))
        res.append((tokens, "system messages", ""))

        # chat history
@ -117,7 +117,7 @@ class Commands:
        if msgs:
            msgs = [dict(role="dummy", content=msg) for msg in msgs]
            msgs = json.dumps(msgs)
-            tokens = len(self.tokenizer.encode(msgs))
+            tokens = self.coder.main_model.token_count(msgs)
            res.append((tokens, "chat history", "use /clear to clear"))

        # repo map
@ -125,7 +125,7 @@ class Commands:
        if self.coder.repo_map:
            repo_content = self.coder.repo_map.get_repo_map(self.coder.abs_fnames, other_files)
            if repo_content:
-                tokens = len(self.tokenizer.encode(repo_content))
+                tokens = self.coder.main_model.token_count(repo_content)
                res.append((tokens, "repository map", "use --map-tokens to resize"))

        # files
@ -134,7 +134,7 @@ class Commands:
            content = self.io.read_text(fname)
            # approximate
            content = f"{relative_fname}\n```\n" + content + "```\n"
-            tokens = len(self.tokenizer.encode(content))
+            tokens = self.coder.main_model.token_count(content)
            res.append((tokens, f"{relative_fname}", "use /drop to drop from chat"))

        self.io.tool_output("Approximate context window usage, in tokens:")