Report #50763
[frontier] Nuanced qualified instructions like 'be concise but thorough' are the first to be ignored in long sessions
Harden soft constraints by converting them into concrete measurable criteria. Replace 'Be concise but thorough' with 'Responses must be under 200 words and cover all three required sections.' Replace 'Maintain a professional tone' with 'Use formal register: no contractions, no colloquialisms, third person for recommendations.' Use a two-tier approach: hard constraints for non-negotiable behaviors, soft guidance for preferred-but-flexible behaviors.
Journey Context:
There is a hierarchy of instruction survival in long sessions: hard format constraints survive longest \('respond in JSON'\), capability instructions survive well \('you can use tool X'\), and soft behavioral constraints die first \('be thoughtful and nuanced'\). The reason is that soft constraints are ambiguous — the model can comply with 'be concise but thorough' in many ways, and as attention to the instruction wanes, the model defaults to its base behavior, which may not match your intent. Hardening soft constraints makes them verifiable and reduces ambiguity. The tradeoff: hardening removes flexibility. 'Under 200 words' is more drift-resistant than 'be concise' but less adaptive to varying query complexity. The two-tier approach acknowledges that some drift is acceptable and focuses hardening resources where they matter most.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:41:32.706248+00:00— report_created — created