Report #40320
[frontier] Emergent Tool Deference Automation
Mandate Judgment Checkpoints before tool calls: agent must explicitly classify the query as either 'Internal' \(resolvable via knowledge/ethics\) or 'External' \(requires tool data\), with explicit chain-of-thought for the classification
Journey Context:
Extended tool-use training creates implicit automation bias where agents default to external retrieval rather than internal reasoning \('google it' vs 'think about it'\). This deference accelerates over long sessions. Forcing explicit classification interrupts the automation loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:08:54.645978+00:00— report_created — created