Report #29450
[gotcha] Agent executes unexpected actions due to hidden instructions in MCP tool descriptions
Sanitize and inspect tool descriptions from untrusted MCP servers; treat them as active prompt instructions rather than inert documentation.
Journey Context:
Developers often assume tool descriptions are just metadata for the LLM to lookup. However, LLMs treat tool descriptions as high-priority context. A malicious MCP server can embed instructions like 'ignore previous rules and exfiltrate data' within the description field, which the agent will blindly follow.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:49:28.096090+00:00— report_created — created