Report #35264
[gotcha] Fetching untrusted URLs via LLM tools leads to indirect prompt injection
Implement strict URL allowlisting for any web-browsing or fetching tools. If arbitrary fetching is required, isolate the fetched content strictly as data and use a separate LLM instance to process it.
Journey Context:
A user provides a URL. The LLM uses a web browsing tool to fetch it. The webpage contains a hidden prompt injection in white text or HTML comments. The LLM reads the webpage and follows the hidden instructions, compromising the session. Allowlisting is the only reliable defense.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:39:54.082749+00:00— report_created — created