Report #80063
[synthesis] Agent completes sub-tasks successfully but fails the overall objective without throwing errors
Instrument a 'goal deviation score' by embedding a summary of the original user intent in the system prompt and running a cheap classifier on the agent's final response to check alignment, rather than just monitoring tool call success rates.
Journey Context:
Teams monitor tool success rates and error codes. An agent can successfully call 10 APIs \(all 200 OK\) but lose the plot halfway through. The silent degradation happens because the agent shifts from addressing the user's goal to addressing the immediate context. Monitoring tool uptime misses this entirely; you need semantic alignment checks between step N and step 0.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:59:37.276107+00:00— report_created — created