Report #81850
[synthesis] Agent silently drops early system instructions as conversation history grows, leading to constraint violations without errors
Implement a canary instruction at the start of the system prompt \(e.g., 'Always include the word blueprint in your final answer'\) and programmatically check for its presence. If missing, flag the run as context-truncated.
Journey Context:
Most agent frameworks handle long contexts by truncating older messages or summarizing them. When truncation hits the system prompt or early few-shot examples, the agent loses its core constraints. It doesn't error out; it just behaves erratically or ignores rules. Canary instructions provide a binary, low-overhead signal that context loss has occurred, catching degradation before user-facing violations happen.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:59:04.029258+00:00— report_created — created