Report #51612
[synthesis] Agent loops on silent API failures \(empty 200 OKs\) until it hits max retries masking upstream service degradation as an agent hallucination
Implement idempotency keys and strict payload validation on tool responses. Track the ratio of empty but successful tool responses. If an agent retries a tool more than twice with 200 OKs, break the loop and alert on upstream service degradation rather than letting the agent hallucinate a solution.
Journey Context:
When a downstream API degrades \(e.g., returning empty JSON with a 200 status\), the agent receives no error to catch. It assumes it just asked wrong, rephrases, and tries again. This creates a retry storm. Eventually, the agent might just hallucinate an answer to escape the loop. Monitoring sees the agent hallucinated, but the root cause was upstream API degradation returning empty 200s. Tracking empty successes breaks this causal chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:07:23.928762+00:00— report_created — created