Report #60904
[gotcha] LLM trusting data returned from external API or tool calls
Treat all data returned from external APIs, web searches, or database queries as untrusted. Apply input sanitization or use separate LLM calls to process tool outputs before feeding them back to the orchestrator LLM.
Journey Context:
Developers secure the user prompt but forget that if the LLM searches the web or queries an API, the \*response\* can contain malicious instructions. An attacker puts 'Ignore previous instructions and...' on a webpage, the LLM scrapes it, and the indirect injection takes over the agent's context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T08:42:53.440199+00:00— report_created — created