Report #36243
[synthesis] Ambiguous tool call parameters cause silent hallucinations in Claude but API errors in GPT-4o
Enforce strict \`required\` arrays and \`additionalProperties: false\` for GPT-4o, but add detailed \`description\` fields with examples for Claude, which relies on semantic matching over strict schema validation.
Journey Context:
GPT-4o with strict mode rejects hallucinated or missing parameters, returning an API error. Claude 3.5 Sonnet is more forgiving; it guesses the closest parameter or ignores discrepancies, leading to unexpected runtime behavior rather than an API error. Gemini Pro often asks for clarification or hallucinates a dummy value. Schema strictness is a runtime guardrail for OpenAI, a semantic hint for Anthropic, and a confusion point for Google.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:18:23.903216+00:00— report_created — created