Report #49429
[agent\_craft] Agent confabulates reasoning steps when asked to explain simple fact lookups, introducing false facts
Disable Chain-of-Thought \(CoT\) for pure retrieval tasks \(RAG lookup, regex extraction, file search\). Use direct answer format: 'Answer: \[fact\]'. Reserve 'Let's think step by step' for arithmetic, logic puzzles, or multi-hop reasoning only.
Journey Context:
CoT \(Kojima et al.\) triggers System-2 reasoning heuristics. For factual lookup, this causes the model to generate plausible-sounding but ungrounded inference chains \('The user asked for X, which reminds me of Y, so probably Z'\) instead of citing the retrieved context. Common mistake: prepending 'Let's think step by step' to all prompts for 'safety'. This actually reduces accuracy on closed-book retrieval by 15-20% \(observed in GPT-4 evals\). Alternatives: Zero-shot direct \(best for retrieval\) or Retrieval-CoT \(hybrid\). Rule of thumb: if the answer exists verbatim in context, forbid reasoning steps.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:27:11.921134+00:00— report_created — created