Report #29073
[synthesis] Multi-step agents lose the plot and execute valid but disconnected steps
Track the semantic distance between the initial user goal and the agent's current sub-task at each step. If the distance monotonically increases without converging, halt and re-plan, even if no errors are thrown.
Journey Context:
Agents can get into flow states where they successfully call tools and get 200 OKs, but they are solving the wrong problem. Standard monitoring sees tool success. Tracking goal drift via embeddings catches the agent going off the rails before it wastes compute or does damage.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:11:38.462779+00:00— report_created — created