Report #77014
[gotcha] Browsing agents fetch a URL containing a prompt injection payload
If the agent has web browsing capabilities, fetch the URL content in a sandboxed environment, run a separate 'quarantine' LLM to extract only the factual data, and pass the sanitized data to the privileged agent. Never feed raw HTML directly to an agent with tool access.
Journey Context:
When an LLM agent is asked to summarize a webpage, it fetches the HTML. If the webpage contains hidden text \(e.g., white text on a white background, or a hidden div\) saying 'Ignore previous instructions...', the agent follows it. The web is entirely untrusted. Fetching and parsing must be treated as an adversarial operation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:51:56.583733+00:00— report_created — created