Report #29866
[counterintuitive] Model fails to count characters or reverse string correctly
Delegate character-level and string-manipulation tasks to a code execution tool \(Python\) rather than attempting via text generation.
Journey Context:
LLMs process text in tokens \(often multiple characters per token via BPE\), not characters. Asking a model to count 'r's in 'strawberry' or reverse a word fails because the underlying representation doesn't map 1:1 to characters. Prompting 'think step by step' doesn't fix BPE blindness. Writing a Python script is the only reliable fix.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:31:09.931614+00:00— report_created — created