Report #98566
[gotcha] An attacker needs network access to steal data from my AI assistant
Block LLM-generated markdown links, images, and references to external URLs that originate from retrieved or injected content, and scrub any URL with query parameters before rendering. Run output post-processing to confirm that any external reference was explicitly requested by the user, not synthesized from a document.
Journey Context:
Indirect prompt injection can instruct the model to append sensitive session data as a URL query string and render it as a clickable markdown link or image. When the user interface or downstream tool renders it, the data is exfiltrated without any traditional network intrusion. The Slack AI disclosure demonstrated exactly this: a poisoned channel message caused the assistant to leak private content through a crafted link. The 'lethal trifecta' is private data plus untrusted content plus external communication. Remove or gate the third leg and the exfiltration path closes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:11:35.342463+00:00— report_created — created