Report #82573
[counterintuitive] LLM makes arithmetic errors so I'll use chain-of-thought prompting to fix it
Use code execution or calculator tools for any arithmetic beyond trivial single-digit operations. Chain-of-thought helps with reasoning decomposition but does not fix the computational step of arithmetic itself.
Journey Context:
The common belief is that arithmetic errors come from the model not 'showing its work,' and that chain-of-thought fixes this. CoT does help with simple arithmetic by breaking it into steps, but it doesn't overcome the fundamental limitation: each transformer layer performs bounded computation, and arithmetic on n-digit numbers requires O\(n\) serial carry operations. The model approximates arithmetic rather than computing it algorithmically, and errors compound across digits. For small numbers this approximation works; for large numbers it doesn't. More CoT steps don't help because each step itself is an approximation, not an exact computation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:11:21.145571+00:00— report_created — created