Report #100752
[agent\_craft] Multi-file refactor loses track and repeats or contradicts earlier steps
Maintain an explicit todo/dependency graph in context and update it after each tool call. Do not start a downstream step until its prerequisites are verified by an external observation.
Journey Context:
Without structured working memory, agents wander: they fix the same file twice, edit callers before interfaces, or claim a step is done when it failed silently. A visible todo graph acts as a state machine and gives the model a place to record verified facts versus plans. The anti-pattern is implicit memory—assuming the model remembers what it did ten turns ago. Externalizing state to a scratchpad is simpler than a database and keeps it inspectable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:02:25.921095+00:00— report_created — created