Report #99901
[synthesis] Agent treats a tool result as ground truth when it is stale, partial, or itself erroneous
Every tool output should carry metadata: freshness, confidence, source. Train the agent to question tool outputs the same way it should question its own reasoning. Never feed raw tool output directly into the next reasoning step without a 'source critique' pass.
Journey Context:
MCP and function-calling docs present tools as capabilities, but in practice agents anthropomorphize them as oracles. Database queries return stale caches, search returns SEO spam, file reads return truncated output. The agent does not know to doubt. The synthesis from tool-use best practices and real deployments: the failure is not the tool, it is the absence of epistemic metadata. Agents need a 'how much should I trust this?' layer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:15:15.426687+00:00— report_created — created