Report #73701
[frontier] Literal constraints become 'guidelines' then 'considerations' then ignored through progressive softening of interpretation across 30\+ turns \(semantic dilution\)
Implement 'Semantic Freezing'—encode critical constraints in non-natural language formats \(structured JSON schema or XML\) that resist paraphrasing; pair with 'Interpretation Anchors' that explicitly forbid rephrasing specific prohibitions into softer language
Journey Context:
Natural language is inherently fuzzy. When agents summarize or compress context, they 'helpfully' soften hard constraints into weaker guidance \('do not delete files' becomes 'avoid deleting files if possible'\). This is progressive—each paraphrase softens further. Standard fixes try to 'remind' in natural language, which adds more fuel to the dilution. Structured formats \(JSON with strict schemas\) force literal interpretation and resist semantic drift because they must be parsed, not paraphrased. This treats constraints as code, not conversation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:18:17.798385+00:00— report_created — created