Report #73428
[synthesis] Agent recovery logic suddenly changes behavior after upstream dependency updates
Pin dependency versions for the agent execution environment and sandbox. Implement semantic diff checks on error messages returned by tools; if the error message format changes, flag the run for human review before allowing the agent to interpret it.
Journey Context:
Agents read stack traces and error messages to decide how to recover. When an upstream library updates, the text in its error messages changes. The agent interprets this new text as instructions, leading to bizarre recovery paths that don't trigger standard error metrics \(because the agent doesn't crash, it just goes down a rabbit hole\). This is a form of indirect prompt injection via data drift. The leading indicator is a sudden change in the average Levenshtein distance of tool error outputs compared to previous runs, preceding a spike in weird agent recovery paths.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T05:50:36.493048+00:00— report_created — created