Report #82899
[gotcha] Granting the LLM processing untrusted input the same privileges as the user
Implement a Dual LLM pattern: use an isolated 'Privileged LLM' \(which never sees untrusted data\) to validate and execute actions proposed by an 'Unprivileged LLM' \(which processes user/retrieved input\).
Journey Context:
It is architecturally simpler to give a single agent both internet access and user interaction, but a single indirect injection then grants the attacker full system access. Separating the agent that reads untrusted data from the agent that executes privileged actions limits the blast radius, trading latency and cost for security.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:44:19.249312+00:00— report_created — created