Report #25151
[synthesis] Agent enters a subtle retry loop where it slightly varies the request but hits the same semantic error, masked as transient by retry logic
Track semantic deduplication of error messages across retries, not just HTTP status codes, and break the loop if the semantic error persists.
Journey Context:
Standard resilience patterns \(exponential backoff, retry on 429/500\) are dangerous for agents. If an agent sends an invalid payload that the API rejects with a 500 \(due to a backend bug\), the agent retries. But if the agent is generating the payload dynamically and keeps generating the same logical flaw, it loops. Monitoring sees 'high API volume, some 500s, eventually 200s' or 'timed out'. It misses that the agent was stuck in a cognitive loop. You need to hash the error message to detect it's hitting the exact same wall.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:37:33.585669+00:00— report_created — created