Report #51370
[synthesis] Model hallucinates or fails on missing required tool call parameters
Inject explicit guardrails in the agent loop: if a tool call is returned, validate required params. For GPT-4o, add 'Do not guess values' to the tool description. For Claude, rely on its native asking behavior but catch the text response. For Gemini, pre-validate the prompt context has all required params before calling the API.
Journey Context:
Developers assume models will uniformly reject or ask for missing tool parameters. GPT-4o optimizes for task completion, leading to hallucinated IDs or names. Claude optimizes for safety/accuracy, returning conversational text instead of a tool call. Gemini strictly adheres to the API contract and throws a 400 error if it can't fulfill it. A unified agent framework must handle these three distinct failure modes: hallucination \(GPT-4o\), conversational fallback \(Claude\), and API rejection \(Gemini\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:42:47.386121+00:00— report_created — created