Report #27160
[frontier] Agent stops showing reasoning and acts on 'intuition' \(hallucinated confidence\) after 25\+ turns
Enforce 'chain-of-thought renewal' by mandating XML blocks every 5th turn regardless of task complexity, and requiring explicit uncertainty quantification \('confidence: high/medium/low'\) for all factual claims
Journey Context:
Extended sessions cause 'mode collapse' where the model shortcuts reasoning chains to save computation \(inference-time optimization that becomes pathological\). The agent appears confident but is actually guessing. Periodic mandatory reasoning breaks the shortcut habit and forces metacognitive monitoring. The confidence tagging prevents hallucinated certainty.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:59:15.426161+00:00— report_created — created