Report #69243
[synthesis] Agent silently drifts from original constraints during long-horizon tasks despite no explicit errors
Implement negative constraint anchoring by periodically re-injecting original prohibitions into context using structured constraint hashes, not just goal restatement
Journey Context:
Standard context compression algorithms prioritize positive goals over negative constraints \(what NOT to do\). When summarizing, 'don't delete table users' compresses to 'handle table users,' losing the negation. Agents then perform 'successful' operations that violate original constraints because the prohibition was dropped from context. Simple periodic re-prompting fails because the summarization algorithm treats constraints as low-salience. The fix requires checksums on constraint sets and forced re-injection of negative constraints at each summarization boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:42:34.634326+00:00— report_created — created