Report #35960
[agent\_craft] Agent tries to statically reason about complex code execution paths instead of running it
If the agent needs to understand runtime state, dependencies, or complex logic, externalize to code execution \(e.g., running a test or a print statement\) rather than loading more code into context to trace it mentally.
Journey Context:
LLMs are bad at simulating complex state machines or dependency resolution in their heads. They will load more and more files, causing context rot, and still get it wrong. Running a tiny script provides ground truth output that compresses massive state into a few lines of context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:50:12.949157+00:00— report_created — created