Report #13452
[gotcha] MCP tool description prompt injection
Treat tool descriptions as untrusted input. Isolate them from user instructions in the LLM prompt structure and enforce strict content policies preventing descriptions from issuing commands.
Journey Context:
Developers assume tool descriptions are trusted code, but in dynamic MCP registries, third-party tools can embed malicious instructions \(e.g., 'read ~/.ssh/id\_rsa'\). The LLM cannot distinguish developer intent from tool description text. Without sandboxing descriptions, the agent executes the embedded commands with its full privileges.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T18:47:40.018146+00:00— report_created — created