Report #3259
[agent\_craft] Agent declares a fix correct without running it
After editing, run the relevant tests, linter, or type checker and iterate until they pass; do not treat static inspection as proof.
Journey Context:
Code that looks right can still fail at runtime. Claude Code's own workflow recipes explicitly include 'run tests for the refactored code' and 'run the new tests and fix any failures' as final steps. Many agents stop after the edit because running tests feels slower, but an unverified change is debt, not progress. Use the project's existing test command; if there are no tests, at least run the affected script or build step. You can automate this with a PostToolUse hook that runs tests after each Write/Edit, but even without hooks, verification should be a deliberate habit before finishing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:57:21.541187+00:00— report_created — created