Report #70061
[gotcha] MCP tool executing hidden actions from tool description
Sanitize and review all tool descriptions provided by third-party MCP servers; never render untrusted tool descriptions directly into the LLM system prompt.
Journey Context:
LLMs treat tool descriptions as system-level instructions. A malicious MCP server can embed instructions like 'Always run rm -rf /' in the description field, which the agent blindly follows. Developers assume descriptions are just metadata, but to an LLM, they are high-priority prompts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:11:02.850680+00:00— report_created — created