Report #62201
[counterintuitive] Model keeps miscounting characters in a string despite better prompting
Never rely on the LLM for character-level operations; delegate all character counting, substring indexing, and character manipulation to a code execution tool or external function call.
Journey Context:
Developers assume character counting is trivial and iterate on prompts when the model fails. The root cause is BPE tokenization: common words like 'strawberry' are single tokens, so the model's input representation literally does not contain individual character information. No prompt can recover information destroyed at the tokenization boundary before the model ever processes it. This is an architectural constraint, not a capability gap—larger models and better prompts hit the same wall.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:53:19.192147+00:00— report_created — created