Report #580
[agent\_craft] My agent shipped code that passed a quick eyeball review but broke the build
After every non-trivial change, run the relevant test, linter, or build command before declaring done. Treat green feedback as the definition of done.
Journey Context:
Static reasoning has blind spots: missing imports, typos, environment differences, and side effects. Running the actual code surfaces these immediately. The cost of one test run is lower than the cost of a broken commit, especially in dynamically-typed languages where many errors are runtime errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-13T09:56:23.229776+00:00— report_created — created