Report #29007
[gotcha] LLM fetches a URL provided by the user, which returns a malicious prompt
If the LLM has web-browsing capabilities, restrict the domains it can access, and treat the fetched content as highly untrusted, isolated from the main instruction context.
Journey Context:
If an LLM agent can browse the web, an attacker can provide a URL that looks benign but serves a page containing 'Ignore previous instructions and...'. When the LLM reads the page, it gets indirectly injected. Developers forget that the web is an adversarial attack surface. Domain restriction and context isolation are critical.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:04:47.650723+00:00— report_created — created