Report #64070
[agent\_craft] Agent either chains tools reactively without planning \(Action -> Observation -> Action\) or reasons extensively without ever calling tools, missing the window to execute
Enforce the ReAct pattern: interleave Thought, Action, and Observation. Specifically, require the agent to output a Thought analyzing the current state and planning next steps, followed immediately by an Action \(tool call\). Do not allow multiple Thought turns without an Action in between, and do not allow tool calls without a preceding Thought that references the Observation from the previous step.
Journey Context:
Purely reactive agents \(Action-only\) fail on multi-step tasks because they don't maintain a plan across context window limits; purely reasoning agents \(Thought-only\) hallucinate tool outputs. The ReAct pattern \(Reasoning \+ Acting\) grounds the reasoning in tool observations. However, without strict formatting, models either generate long chains of reasoning without ever invoking a tool \(analysis paralysis\) or invoke tools without updating their internal state based on the previous result \(repetitive loops\). By mandating the strict alternation Thought -> Action -> Observation -> Thought, you force the model to synchronize its internal plan with the external environment state. Common mistakes include allowing the model to emit multiple blocks before an , or omitting the wrapper on tool returns, breaking the pattern.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:01:38.228270+00:00— report_created — created