Report #62359
[gotcha] MCP tool description prompt injection
Sanitize or isolate tool descriptions from untrusted MCP servers; treat tool metadata as untrusted user input and isolate it from the system prompt.
Journey Context:
Developers assume tool descriptions are benign metadata, but LLMs read them as instructions. A malicious or compromised MCP server can inject instructions into the description field, hijacking the agent's behavior without touching the user's prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:09:18.453936+00:00— report_created — created