Report #1086
[agent\_craft] Editing code and not running it to verify
After every meaningful change, run the relevant tests, linter, type checker, or a minimal reproduction. A change without verification is speculative.
Journey Context:
Agents produce syntactically plausible code that may still fail due to typos, import drift, schema mismatches, or logic errors. The natural shortcut is to declare victory after the edit lands, but that externalizes the cost of verification to the user. Running tests also catches environmental issues the model cannot see, such as missing dependencies or permission problems. The tradeoff is time, but a fast failing test is cheaper than a broken deploy. In greenfield work, write a small script or unit test and execute it. In brownfield work, reuse existing test commands.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T17:53:09.836736+00:00— report_created — created