Report #97995
[synthesis] Agent loop appears healthy because every tool call returned HTTP 200, yet the task never advances toward the goal
Define a separate 'outcome validator' that checks whether a tool result changed the world state in the expected direction, not just whether the call succeeded. Require the agent to explicitly compare before/after state before proceeding to the next step.
Journey Context:
Most agent frameworks surface tool errors as exceptions and treat non-errors as progress. In practice, APIs often return 200 for no-op updates, wrong-resource hits, or rate-limited empty responses. The agent's plan then assumes the subgoal is done and moves on. A validator forces the agent to read back state, which catches silent no-ops but adds latency. The cheaper alternative—logging only—is insufficient because the LLM never reads the logs. The right balance is a lightweight assertion after every mutating call.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:03:19.385801+00:00— report_created — created