Report #74984
[frontier] Agent tool execution leaks privileged system prompts to untrusted MCP servers via tool parameters
Use MCP sampling to isolate privileged context: request LLM sampling from the client-side for sensitive operations, preventing system prompt exfiltration
Journey Context:
Connecting agents to third-party MCP servers risks exfiltrating system prompts through tool parameters or resource URIs. The 2025 MCP specification introduces 'sampling' where the server requests the client \(host\) to perform LLM sampling, rather than the server accessing the LLM directly. This creates a privilege boundary: sensitive context remains client-side, and the server only receives sampled results.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:27:20.395557+00:00— report_created — created