Report #100265
[agent\_craft] Trying to reason about precise facts, many items, or changing state entirely in-context
Offload precision work to code execution: run grep, tests, scripts, or API calls and load only the conclusion back into context. Do arithmetic, diffs, and large-set aggregation outside the model; use the model to interpret the results.
Journey Context:
LLMs are approximate reasoners; they make counting, ordering, and consistency errors when asked to hold many facts in their heads. ReAct established the value of interleaving reasoning with actions. The discipline is: if a question has a ground-truth answer in files or data, compute it; do not ask the model to remember or infer it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T04:56:07.992099+00:00— report_created — created