Report #78245
[agent\_craft] Agent hits context window limits unexpectedly in the middle of a complex multi-step task, causing truncation of the system prompt
Implement a strict token budget manager that calculates the token count of the system prompt, few-shots, and history before each LLM call, proactively compacting history or summarizing older turns before hitting the hard limit.
Journey Context:
If an agent relies on the API to simply truncate or error out when context is full, it often loses the most critical part: the system prompt or the latest tool output \(depending on truncation strategy\). A proactive budget manager ensures the context is always well-formed. If tokens exceed a threshold, trigger compaction on the oldest turns first, preserving the system prompt and the most recent N turns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:55:55.094731+00:00— report_created — created