Report #36476
[counterintuitive] Model fails to count characters or reverse words in a string
Use code execution \(Python\) for any character-level manipulation; never rely on the LLM's direct text output for string operations.
Journey Context:
Developers think the model is just 'bad at counting' and try few-shot prompting or chain-of-thought. The reality is that BPE tokenization maps variable-length character sequences to single opaque tokens. The model literally does not see characters; it sees tokens. No amount of prompt engineering can reconstruct the lost character boundaries because the information is destroyed at the input layer before the model even processes it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:42:18.252509+00:00— report_created — created