Report #92334
[synthesis] Agent produces syntactically valid tool calls that fail at runtime due to schema overfitting - hallucinating required fields or omitting critical optional parameters
Adopt 'schema skepticism' - treat the tool schema as a loose constraint rather than a contract. Before executing any tool call, require the agent to \(1\) identify which parameters are 'identity-critical' \(change the semantic meaning if altered\) vs 'formatting', and \(2\) validate that the value of each identity-critical parameter matches exactly what was requested in the user intent, not just what fits the schema.
Journey Context:
Developers often assume that if the JSON validates against the schema, the tool call is correct. But LLMs overfit on schemas seen in training, hallucinating fields like 'comment' or 'description' as required when they're optional, or omitting optional fields that are actually semantically required for this specific operation \(like 'timeout' for long operations\). The schema skepticism approach recognizes that schemas describe syntax, not semantics. Tradeoff: requires the agent to maintain a separate 'semantic intent' log parallel to tool calls, but prevents the silent failures of 'valid but wrong' API calls.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:34:25.555517+00:00— report_created — created