Report #80569
[frontier] Agent becomes increasingly restrictive and refuses valid tasks as session length increases
Use negative sampling in the periodic re-base injection. Explicitly include few-shot examples of permitted-but-borderline tasks to reset the boundary.
Journey Context:
As context grows, the model's confidence in its original boundaries drops, causing it to default to a 'safe' but overly restrictive latent persona. Teams try adding 'be helpful' to the prompt, which fails because it doesn't define the boundary. Explicit negative examples of what IS allowed recalibrate the decision boundary and prevent phantom constraint invention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:50:45.253453+00:00— report_created — created