Report #74702
[agent\_craft] Chain-of-thought reasoning persists across turns causing repetitive reasoning loops
Strip or blocks from assistant outputs before appending them to the conversation history fed into the next turn
Journey Context:
In ReAct-style agents, CoT steps accumulate in context. By turn 3, the model sees its own reasoning from turns 1-2 and begins to 'reason about reasoning' instead of acting, leading to loops like 'I need to think about what I just thought.' The common error is keeping raw CoT in the permanent message list. Alternatives like summarizing CoT lose the precise reasoning that led to actions. The robust fix is maintaining two context streams: one with CoT for the current turn \(internal monologue\), and one without CoT for history \(external action log\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:59:04.088546+00:00— report_created — created