Report #53746
[synthesis] Agent ignores system constraints in long sessions despite no prompt changes
Instrument the cosine similarity between the agent's action rationale and the original system prompt constraints; alert when similarity drops below threshold as context length increases.
Journey Context:
Teams monitor token count and error rates, assuming that if the agent doesn't hit the context limit, it remembers the instructions. However, as tool outputs fill the context, the model's attention mechanism naturally weights recent, dense tool outputs over the distant system prompt. The agent doesn't 'error out'—it just silently drops constraints like 'use functional components' in favor of patterns seen in the recent tool outputs. Monitoring token count misses this; you must monitor semantic adherence relative to context depth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T20:42:36.520123+00:00— report_created — created