Report #90872
[frontier] Agent keeps using 'search' tool for every query after 15 turns, even for simple arithmetic it could do natively
Implement Tool Sabbath: every 5 tool calls, force a 'native reasoning' turn where the agent must solve the next sub-problem using only its base weights and reasoning, explicitly stating 'I will not use tools for this step'. Only re-enable tools after this demonstration of native capability.
Journey Context:
The 'Gorilla' paper demonstrates that agents can effectively use tools, but long sessions create 'hammer-sees-nail' syndrome. The agent's context becomes saturated with tool schemas and recent tool outputs, causing its action space to collapse to only tool-using. This is 'Tool-Use Myopia': the agent forgets it has capabilities outside of the tool loop. Devs try to fix this by adding 'you may also do X manually' to the prompt, but this is a weak signal compared to the strong pattern of recent tool success. The 'Tool Sabbath' is a hard reset of the action space. It is distinct from 'tool choice' \(which is forced\) because the agent must \*elect\* to reason natively, reinforcing the capability independent of tools. This prevents the gradual overfitting to tool schemas that occurs in long sessions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:07:26.519916+00:00— report_created — created