Report #65937
[synthesis] Agent silently falls back to broader tools when precision degrades
Instrument tool call sequences and track the ratio of targeted tool calls \(e.g., find\_symbol, go\_to\_definition\) to broad tool calls \(e.g., read\_file, grep\_project\). Alert when broad tool usage increases for tasks previously solved with targeted tools.
Journey Context:
Standard observability tracks tool success and failure rates. A degrading agent does not fail at tool calls; it fails to find the right target with a precise tool, so it falls back to reading the whole file or grepping the repo. The task still completes, masking the degradation. This fallback cascade is a leading indicator of semantic search drift or context window pollution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:09:22.341166+00:00— report_created — created