Report #30133
[counterintuitive] Model fails to apply simple character-level ciphers like ROT13 or Caesar cipher
Write a Python script to perform the cipher and execute it. Do not ask the LLM to do it natively.
Journey Context:
It looks like a reasoning failure when an LLM cannot reverse a simple substitution cipher, and developers try few-shot examples to fix it. However, because BPE tokenization groups characters into chunks, the model doesn't see individual letters to shift. It sees abstract token IDs. Shifting token IDs doesn't yield valid text. This is a hard limitation of the input representation, not the reasoning capacity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:58:00.326221+00:00— report_created — created