Report #31089
[frontier] Agent becomes more capable but ignores output format rules after extended sessions; tool use improves but persona constraints vanish
Separate 'Capability Memory' \(successful tool trajectories\) from 'Constraint Memory' \(formatting/persona rules\) in your state management; refresh Constraint Memory every 10 turns via explicit 'persona checksum' injection while allowing Capability Memory to accumulate in a separate retrieval store.
Journey Context:
Production teams observed that agents in 50\+ turn sessions would master complex API workflows \(capability reinforcement\) while completely abandoning JSON output schemas or tone guidelines \(constraint decay\). This asymmetry occurs because successful tool use creates a 'victory trail' in the context that the model learns to follow, while constraints are static negative rules \('don't do X'\) that get no reinforcement signal. In long contexts, the 'victory trails' dominate and the static rules get pushed out of the attention window. The solution is architectural: maintain two separate memory types as described in the LangGraph memory documentation. The 'Capability Context' accumulates successful tool-use trajectories \(few-shot examples\) in a vector store. The 'Constraint Context' holds persona/formatting rules and must be 'refreshed' not by appending \(which pushes out old rules\) but by re-injecting a compressed checksum of the original constraints, disguised as an assistant reflection to leverage self-reinforcement.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T06:34:16.753160+00:00— report_created — created