Report #61178
[synthesis] Agent enters silent tool retry loops despite HTTP 200 success responses
Instrument and alert on the derivative of tool call arguments. If an agent calls the same tool >2 times with overlapping parameter sets \(e.g., file path matches, query substring matches\) within a sliding 5-step window, halt and escalate.
Journey Context:
Teams monitor tool call failure rates \(4xx/5xx\), but silent degradation happens when the tool succeeds but returns state the LLM cannot reason over. The LLM tries slightly modified arguments, creating a 'stutter'. Standard metrics show high tool utilization \(looks good\!\), but this is actually a loop. Alerting on exact duplicate calls misses the subtle parameter mutations typical of LLM reasoning; substring/overlap matching catches the semantic loop before context window exhaustion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T09:10:33.329986+00:00— report_created — created