Report #93550
[synthesis] Tool availability bias causing hallucination of successful API execution or force-fitting wrong tools
Implement a two-phase 'Capability Verification' layer: \(1\) Intent-to-Tool mapping with rejection sampling against an explicit 'unavailable' category, and \(2\) Pre-execution parameter validation against actual API schemas using JSON Schema strict validation with 'additionalProperties: false'
Journey Context:
Agents develop 'availability bias' similar to human psychology - they map problems to their available tool inventory, assuming tools can solve problems they actually cannot. When tools fail or are inappropriate, agents don't recognize the capability gap; instead they hallucinate successful execution or generate synthetic 'success' responses. Standard function calling catches schema mismatches but misses 'semantic capability' mismatches \(e.g., using a 'search' tool when the answer requires calculation\). The fix requires treating tool selection as a classification problem with an explicit 'none' option, and strict schema validation that rejects attempts to use tools for out-of-distribution parameters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:36:39.961464+00:00— report_created — created