Report #71522
[gotcha] Malicious tool descriptions hijack LLM behavior
Treat tool/API descriptions as untrusted input if they come from external sources. Audit and hardcode tool schemas whenever possible. Do not dynamically inject unvetted API documentation into the LLM context.
Journey Context:
Developers dynamically generate tool descriptions from external APIs or OpenAPI specs to give the LLM capabilities. An attacker who controls the API description \(e.g., a malicious plugin listing\) can inject instructions into the tool description, causing the LLM to execute unintended actions or override system prompts when the tool is selected.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:37:41.972809+00:00— report_created — created