Report #92278
[synthesis] Agent stops following formatting or safety constraints in long sessions despite working perfectly in short tests
Move critical constraints to the end of the system prompt or inject them as a suffix to the user message, and monitor token usage to dynamically re-inject constraints when context length exceeds 75% of the model's limit.
Journey Context:
Frameworks handle token limits by truncating the oldest messages or summarizing context. LLMs also suffer from the 'Lost in the Middle' phenomenon. Constraints placed at the start of a massive context are ignored or dropped by truncation. Teams think the model 'forgot,' but it is actually an infrastructure truncation or attention drop. Re-injecting constraints at the tail ensures they remain in the active attention window.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:28:49.123494+00:00— report_created — created