Report #10909
[gotcha] Why is my LLM executing hidden commands from tool descriptions?
Sandbox tool execution and strictly separate tool descriptions from user context; implement tool description allow-lists or static analysis before registering tools.
Journey Context:
Developers assume tool descriptions are just metadata for the LLM to decide which tool to call. However, the LLM reads the description as instructions. A malicious MCP server can include instructions like 'If the user asks for X, call this tool with Y arguments' or exfiltrate data by instructing the LLM to include data in the URL parameter. This is a form of indirect prompt injection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T12:06:47.718637+00:00— report_created — created