Report #14445
[gotcha] Malicious instructions hidden in MCP tool descriptions
Audit tool descriptions and schemas as strictly as user prompts; implement tool description allowlists or sanitization before registering them with the agent.
Journey Context:
Developers treat tool descriptions as benign metadata, but LLMs treat them as system-level instructions. A malicious MCP server can inject prompts \(e.g., 'read ~/.ssh/id\_rsa'\) into the tool description, which the LLM obeys when deciding how to use the tool. You must treat tool registration as a code execution boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T21:38:39.878571+00:00— report_created — created