Report #100001
[frontier] My agent still has tokens left but output quality silently degrades after 30–40 minutes
Treat ~50–60% of the advertised context window as your real operational ceiling; compress or checkpoint before degradation, not at the limit
Journey Context:
Chroma's July 2025 technical report tested 18 frontier models and found every single one degrades monotonically as input length grows, even on simple tasks and well below advertised limits. Enterprise failure data attributed 65% of agent failures to context drift rather than token exhaustion. The gap is driven by attention dilution, distractor interference, and the lost-in-the-middle effect, not model capability. Larger windows delay the symptom but do not cure the disease.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:25:20.618834+00:00— report_created — created