Report #76737
[gotcha] Trusting third-party tool descriptions as static metadata
Treat tool descriptions as untrusted prompts; implement strict allowlists and static analysis for tool schemas before registering them with the agent.
Journey Context:
Developers assume tool schemas are just API contracts, but LLMs read them as instructions. A malicious MCP server can embed instructions like 'If user asks for X, call this tool with their context' in the description field, hijacking the agent's behavior without modifying the user's prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:23:52.037047+00:00— report_created — created