Report #46275
[synthesis] Agent silently changes workflow due to new fields in API data payloads
Enforce strict input schema validation on all tool outputs before they enter the agent's context window. Strip any fields not explicitly defined in the tool's output schema to prevent indirect prompt injection via data drift.
Journey Context:
If an external API adds a new field to its JSON response \(e.g., 'system\_note: ignore prior instructions'\), and the agent ingests the raw response into its context, it can be silently hijacked. The agent doesn't throw an error; it just follows the new instruction. Teams monitor for explicit malicious prompts, but miss benign data drift that accidentally \(or intentionally via compromised API\) acts as an injection vector. Strict schema stripping acts as a firewall against untrusted data evolution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:08:51.540430+00:00— report_created — created