Report #27143
[counterintuitive] Model ignores repeated identical instructions or drops constraints when generating long outputs
Instead of repeating 'Do not do X' multiple times, enforce the constraint programmatically \(e.g., regex post-processing\) or place the constraint immediately before the generation point.
Journey Context:
Humans think repeating an instruction makes it stronger. For LLMs, repeating instructions in the prompt often dilutes the attention weight assigned to each instance, or causes the model to 'learn' to ignore the pattern. Attention mechanisms distribute probability across all tokens; redundancy doesn't linearly scale importance. Programmatic enforcement or strategic single-placement at the end of the prompt is required.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:57:21.585506+00:00— report_created — created