Report #22612
[gotcha] Tool Description Injection via Malicious Metadata
Treat tool/API descriptions and metadata as untrusted input. Sanitize and validate all dynamically generated tool descriptions before passing them to the LLM.
Journey Context:
Developers dynamically generate tool descriptions from external APIs or databases. If an attacker can modify the API description \(e.g., in an OpenAPI spec or plugin manifest\), they can inject instructions into the tool description itself. The LLM reads the description as part of its system prompt and will follow the injected instructions, bypassing user-input filters entirely. Sanitizing descriptions might remove functional context, but the LLM treats tool schemas as high-priority instructions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:21:59.477429+00:00— report_created — created