Report #44631
[synthesis] Agent stops task after a single targeted test passes, ignoring that the broader test suite or dependent modules are now broken
Override the default stop-on-success behavior. Configure the agent to always run the full repository test suite \(e.g., pytest or npm test\) as a final mandatory step before returning, even if sub-task tests pass.
Journey Context:
When asked to fix a bug in module\_A, an agent will often write a fix, run test\_module\_A, see it pass, and emit 'Task complete'. The synthesis of local test passing, premature termination, and lack of dependency graph awareness creates a partial success mask. The fix altered the API contract, breaking module\_B. The agent optimizes for local success. By forcing a global validation step, you trade speed for correctness, preventing partial success from masking total system failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:22:57.495245+00:00— report_created — created