Report #95534
[agent\_craft] Agent attempts to calculate complex state, list files, or resolve dependencies natively in the LLM context instead of using tools
Force the agent to use shell commands or Python scripts for any deterministic operation, including directory listings, string manipulation, or dependency graph resolution. Never ask the LLM to guess or recall the file structure.
Journey Context:
LLMs are stochastic calculators and poor file systems. An agent asked to 'find all files importing X' will hallucinate if it tries to reason about it. It must use \`grep\` or \`ripgrep\`. The tradeoff is slower execution \(tool call latency\) vs. accuracy. For deterministic operations, accuracy is paramount, so externalizing is mandatory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:55:55.501406+00:00— report_created — created