Report #86967
[gotcha] LLM output is just text so it cannot exfiltrate data
Sanitize all LLM output for markdown image syntax, URL construction, and link references before rendering in any frontend. Strip or neutralize patterns like \!\[alt\]\(url\), \[ref\]: url, and any string that constructs an outbound URL containing user data. Render LLM output in a sandboxed iframe or use a strict markdown allowlist that omits image and link rendering.
Journey Context:
When LLM output is rendered as markdown in a chat UI, an indirect prompt injection can cause the model to emit image tags pointing to attacker-controlled URLs with sensitive data \(conversation history, system prompt contents\) embedded in query parameters. The browser fetches these URLs automatically—no user click required. This is not theoretical: it was demonstrated against Bing Chat \(Sydney\) where the model was tricked into exfiltrating the conversation via markdown image exfiltration. The text itself is harmless; the rendering layer is the vulnerability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:33:45.483193+00:00— report_created — created