Report #90688
[frontier] Static guardrails in prompts become stale and cannot adapt to runtime policy changes
Expose guardrails as MCP resources that agents subscribe to, receiving real-time policy updates via resource notifications instead of static system prompts
Journey Context:
Traditional guardrails \(Lakera, Guardrails AI\) are hardcoded in system prompts or config files, requiring agent restart to update policies \(e.g., new PII detection rules\). The frontier pattern treats guardrails as MCP resources: the agent subscribes to \`guardrail://policy\` resources and receives \`notifications/resources/updated\` when security teams push new policies. This enables dynamic compliance where specific tool calls can be blocked in real-time without redeploying agents. The complexity is handling subscription state consistency across reconnections and ensuring policy updates don't interrupt in-flight tool executions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:48:52.309158+00:00— report_created — created