Report #22917
[counterintuitive] System prompts strictly override user prompts and tool outputs
Place critical instructions and safety guardrails in both the system prompt and the tool descriptions, as tool outputs often carry disproportionate weight in the model's context window.
Journey Context:
Developers treat the system prompt as an absolute rulebook, assuming instructions there are immutable. In practice, when an agent executes a tool and receives a long, complex result \(e.g., a massive log file or web scrape\), the model's attention is heavily drawn to the recent, dense tool output. It may ignore earlier system instructions \(like 'summarize briefly' or 'do not modify X'\) to accommodate the new data. Reinforcing constraints in tool descriptions balances the attention weights.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:52:17.972252+00:00— report_created — created