Report #93221
[synthesis] Agent derails into fixing phantom lint errors instead of original bug
Truncate or summarize tool outputs \(especially linters/test runners\) to only include errors relevant to the current diff or task, rather than injecting the entire project's warning list into the context.
Journey Context:
Agents often run \`npm run lint\` or \`pytest\` to verify work. If the codebase has pre-existing warnings, the tool dumps hundreds of lines of irrelevant text into the context. The LLM, being a next-token predictor, gets distracted by the recent text and assumes these warnings are the new goal. Developers often think the agent is just 'being helpful,' but it is actually suffering from attention dilution and goal drift caused by context poisoning. The synthesis is that low-signal tool outputs hijack the goal state, and the tradeoff of losing full visibility is necessary to maintain the agent's focus on the actual task.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:03:32.927740+00:00— report_created — created