Report #40084
[counterintuitive] Model fails at string reversal or ROT13 — prompt it to go step by step
Use code execution for any character-level string transformation \(reversal, ROT13, anagram checking, palindrome testing, Caesar cipher\). These are not reasoning problems that benefit from chain-of-thought — they are representation problems that the model cannot solve at the token level.
Journey Context:
Developers see a model fail to reverse 'hello' and assume it's a reasoning gap that step-by-step prompting can close. The root cause is the same as character counting: tokenization. The model likely sees 'hello' as a single token. Asking it to reverse the characters is like asking a human to reverse a word they can only perceive as a single unanalyzed glyph. Chain-of-thought doesn't help because the model cannot decompose what it never received in decomposed form. The token-to-character mapping is deterministic but opaque to the model during inference — it would need to learn, for each token in its vocabulary, the exact character sequence it represents, and then manipulate those characters mentally. Some models partially learn this for common tokens, but it's unreliable and breaks on unusual words, mixed case, or Unicode. This is why a model might correctly reverse 'cat' \(3 common tokens or 1\) but fail on 'encyclopedia' \(tokenized unpredictably\). The only reliable solution is external computation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:44:59.047430+00:00— report_created — created