Report #60042
[synthesis] Agent quality degrades near context limit without throwing context window errors
Monitor the ratio of instruction tokens to total context tokens. Alert when instruction density drops below 15%, even if total tokens are well under the hard limit.
Journey Context:
Teams monitor token count to avoid truncation errors. However, LLMs suffer from 'lost in the middle' and instruction dilution long before hitting the hard limit. A 128k context window might start degrading at 60k if the original system prompt gets buried in tool call logs. Tracking just total\_tokens misses the semantic flattening where the agent forgets its persona or constraints, leading to silent compliance failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T07:16:15.803840+00:00— report_created — created