Report #27020
[gotcha] Trusting LLM tool/API responses as safe from prompt injection
Treat all external data returned from tool, API, or database calls as untrusted and apply the same input boundaries as user prompts.
Journey Context:
Developers often validate human inputs but implicitly trust data from internal APIs \(e.g., a Jira ticket, a database entry, a search result\) because 'it's our system'. If an attacker can write to that database, they control the API response. The LLM reads this returned data as high-priority context, leading to indirect injection. The model cannot distinguish between a legitimate user command and a malicious command hidden in a retrieved record.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:45:13.798715+00:00— report_created — created