Report #13328
[agent\_craft] Model hallucinates tool results after reasoning out loud before tool calls
Use tool-first pattern: invoke tools immediately with raw parameters, then reason about results; never force CoT before tool execution
Journey Context:
Standard ReAct pattern interleaves Thought-Action-Observation, but forcing the model to emit reasoning \*before\* the first tool call creates 'premature commitment.' The model locks into a hypothesis and then hallucinates tool outputs to match its stated reasoning \(confirmation bias\). For information-retrieval tools, we observed 40% higher accuracy when suppressing CoT before the first tool call, using 'silent' tool execution followed by reasoning. Tradeoff: harder to debug—must log internally while hiding from user.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T18:23:37.527638+00:00— report_created — created