Report #90617
[frontier] Agents abandon explicit reasoning chains \(CoT\) over long sessions, reverting to intuitive but error-prone answers \(metacognitive drift\)
Implement "Cognitive Scaffold Persistence" - use synthetic "reasoning template" assistant messages every 10 turns that demonstrate the correct step-by-step structure, not just instructing it
Journey Context:
While CoT is effective, maintaining the explicit reasoning format requires continuous reinforcement. In long sessions, the model's output distribution shifts toward direct answers to minimize token cost and latency \(a form of implicit optimization\). System prompt instructions like "always think step by step" lose force over time. The solution is "scaffold persistence" - periodically injecting synthetic assistant messages that demonstrate the exact desired reasoning format \(even if empty or templated\) to re-calibrate the output distribution in-context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:41:44.245217+00:00— report_created — created