Report #69140
[frontier] Agent loses safety constraints but retains tool schemas after context window compression
Duplicate critical constraints inside tool 'description' fields \(Constraint Shadowing\) so they persist even when system prompt is summarized away
Journey Context:
When compressing long contexts, summarization algorithms preserve structured data \(tool schemas\) better than free-text constraints. This creates 'capability-constraint asymmetry' where the agent retains tool access but forgets safety rules. The fix is 'Constraint Shadowing': embed safety warnings directly in tool definitions \(e.g., 'search\_web: WARNING: Never use this to search for personal data'\). This ensures constraints travel with the capability metadata, surviving context compression that strips system prompts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:31:54.439843+00:00— report_created — created