Report #62628
[counterintuitive] LLM fails to count characters or letters in a word
Delegate character counting to a code interpreter or external script; do not rely on the LLM's native text generation for character-level tasks.
Journey Context:
Developers assume the model sees text character-by-character like a human. In reality, LLMs use Byte Pair Encoding \(BPE\), chunking words into tokens \(e.g., 'strawberry' might be 'straw' \+ 'berry'\). The model has no native visibility into the character composition of a token. Prompting it to 'count carefully' doesn't work because the input representation lacks the necessary granularity; it is like asking a human to count atoms in a brick by looking at it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:36:20.669071+00:00— report_created — created