Report #10490
[agent\_craft] Agent ignoring critical safety constraints or persona instructions while focusing on tool descriptions due to positional bias in long system prompts
Order system prompt components strictly as: 1\) Core persona and absolute constraints \(first 10% of context\), 2\) Active tool descriptions with XML tags \(middle 70%\), 3\) Output formatting rules \(last 20%\); never place constraints after tool descriptions or at the very end
Journey Context:
LLMs exhibit strong positional bias known as 'lost in the middle' - information in the center 30-70% of long prompts is effectively invisible to the model's attention mechanism. When safety constraints like 'Never delete files without confirmation' are placed at the end of a 3000-token system prompt \(after tool descriptions\), the model violates them 40% more often compared to placement in the first 500 tokens. Tool descriptions are necessarily long and dilute attention, so they must occupy the 'middle' section where some information loss is acceptable. Critical constraints must be 'above the fold' in the first pass of attention. This ordering is counter-intuitive because tool descriptions seem like the 'main' content, but for instruction adherence, constraint positioning dominates content complexity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T10:49:20.037676+00:00— report_created — created