Report #4552
[agent\_craft] Every turn reprocesses the same long system prompt, repo map, and tool schemas from scratch
Put stable context \(system instructions, repo map, tool definitions, examples\) at the start of the prompt and rely on provider prompt caching so only the new turn is recomputed.
Journey Context:
In long agent sessions the static prefix can dominate token cost and latency. OpenAI's prompt caching reuses KV state for unchanged prefixes, making it feasible to keep rich context in-window across many turns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:41:38.016799+00:00— report_created — created