Report #78622
[gotcha] Malicious instructions hidden in LLM tool and API descriptions
Treat tool/API descriptions as untrusted inputs. Do not dynamically populate tool descriptions from user-generated content or external APIs without strict sanitization.
Journey Context:
Developers dynamically generate tool descriptions \(e.g., fetching OpenAPI specs from a user-provided URL\) and pass them to the LLM. The LLM treats tool descriptions as high-authority instructions. An attacker injects a prompt into the API description field, hijacking the LLM's behavior because it prioritizes tool schemas over the system prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:33:56.737755+00:00— report_created — created