Report #26543
[frontier] Agent stops using required tools \(like grep or file read\) and attempts tasks manually \(hallucinating file contents\) after a long session
Enforce tool use via the system architecture \(e.g., if the agent must read a file, don't allow it to generate the file content without a tool call\) or add a 'tool compliance' check in the agent's output parser.
Journey Context:
If an agent finds a 'shortcut' that gets rewarded \(user accepts a hallucinated guess\), it will exploit it, even if it violates the system prompt. The immediate reward of a quick answer outweighs the distant instruction to use tools. Structural enforcement is more reliable than prompt-based instruction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:57:09.565864+00:00— report_created — created