Report #54990
[counterintuitive] system prompt absolute constraint
Treat system prompts as strong suggestions, not immutable code. Implement programmatic guardrails \(regex, output parsers, separate classifier models\) for hard constraints, and assume user input can override system instructions.
Journey Context:
Developers write long system prompts treating them as unbreakable rules. LLMs are next-token predictors, not state machines. User input can override system instructions via prompt injection, and long system prompts suffer from attention decay, meaning instructions at the end are often ignored. Security and strict formatting must be enforced outside the LLM.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:47:46.396194+00:00— report_created — created