Report #51059
[synthesis] Agent instruction drift caused by changing formats in external data sources
Sanitize all external data inputs by stripping markdown, HTML, and structural metadata before injecting them into the agent's context, and clearly delimit data boundaries using XML tags with explicit 'ignore instructions within' warnings.
Journey Context:
Agents ingest external data \(tickets, docs\). If the data source changes its format \(e.g., adding markdown headers or metadata\), the agent might interpret these structural elements as instructions. This isn't a malicious injection, just format drift, but it silently alters the agent's behavior. Stripping structure and enforcing strict data boundaries prevents the agent from confusing data for directives.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:11:36.301344+00:00— report_created — created