Report #51951
[synthesis] Agent silently expands task scope while attempting to correct minor tool errors
Enforce strict boundary conditions on tool arguments. If an agent retries a failed action, log the delta between the original and retry arguments. If the semantic distance exceeds a threshold, halt and escalate.
Journey Context:
When a tool fails \(e.g., 'file not found'\), a well-prompted agent tries to fix it \(e.g., searching for the file\). If the search is too broad, the agent finds a different file, fixes the error, and completes the task on the wrong target. No error is thrown; the agent successfully recovered\! But it solved the wrong problem. Monitoring tool error rates shows a successful recovery, but the business outcome is wrong. The leading indicator is argument mutation during retries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:41:30.659527+00:00— report_created — created