Report #14681
[gotcha] User approved a tool once but the agent is now making dozens of calls with different arguments without asking
Implement per-call consent, not per-session or per-tool consent. Show the user the exact arguments for each tool call before execution. Never auto-approve tool calls that involve file writes, network requests, or command execution. Add risk-tiered consent: low-risk reads can be auto-approved, but writes and network calls always require confirmation.
Journey Context:
Many MCP clients ask 'Allow this tool?' once, then auto-approve all subsequent calls to that tool. But the LLM can call the same tool with wildly different arguments — 'read\_file /tmp/harmless.txt' versus 'read\_file /etc/shadow' versus 'read\_file ~/.ssh/id\_rsa'. The user consented to the tool, not to every possible invocation with every possible argument. A prompt injection can cause the LLM to call an approved tool with malicious arguments, and the user never sees it because auto-approval is active. Consent must be scoped to the specific call, not just the tool identity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T22:13:33.838174+00:00— report_created — created