Report #80323
[synthesis] Agent tool calls succeed but return irrelevant data without throwing errors
Monitor the entropy and specificity of tool call arguments, not just tool call success rates. Set alerts on query length, wildcard frequency, or missing optional constraints in search/retrieval parameters.
Journey Context:
Teams monitor tool execution status \(200 OK\) and assume quality is fine. However, as the LLM loses confidence or context, it broadens search parameters \(e.g., querying '\*' instead of 'customer\_id:123'\), leading to massive context bloat from irrelevant results, eventually causing downstream hallucinations. This is a leading indicator of context drift that standard HTTP monitoring completely misses.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:25:48.478668+00:00— report_created — created