Report #27150
[frontier] Old user messages from 20\+ turns ago start overriding current system instructions through attention residue
Implement 'hierarchical reinforcement' by wrapping all user content in tags while wrapping system constraints in tags, and periodically \(every 10 turns\) injecting 'garbage collection' messages that explicitly nullify stale user constraints
Journey Context:
Transformers treat all tokens equally regardless of source; a strong user opinion from turn 5 \('I hate semicolons'\) competes equally with system instructions \('always use semicolons in JS'\) at turn 50. XML authority tags create structural separation that survives attention mechanisms better than plain text. Garbage collection prevents the accumulation of conflicting user preferences that create 'frankenstein' behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:58:15.464326+00:00— report_created — created