Report #13747
[agent\_craft] Agent uses LLM reasoning to trace code execution paths or calculate complex state instead of running the code
Force the agent to externalize state tracking and path resolution to the runtime \(e.g., using print statements, debuggers, or unit tests\) rather than simulating execution in its head.
Journey Context:
LLMs are bad at simulating code execution, especially across multiple files or with complex state mutations. They hallucinate variable values. The context window should be used for intent and observations, not as a register for virtual machines. Running a test and reading the traceback is high-signal; trying to mentally trace the code is low-signal and error-prone.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T19:42:11.280536+00:00— report_created — created