Report #92184
[synthesis] Tool calls executed with hallucinated or default parameters instead of asking the user
Explicitly define in the system prompt: 'If required parameters are missing or ambiguous, do NOT execute the tool; ask the user for clarification.' For GPT-4o, add strict: true to the function schema.
Journey Context:
When faced with an ambiguous tool call \(e.g., missing a required date parameter\), Claude 3.5 Sonnet typically halts and asks the user. GPT-4o has a strong bias towards taking action, often hallucinating a plausible default \(like today's date\) or inferring it from context. Gemini varies but leans towards GPT-4o's behavior. This cross-model diff causes silent failures in agentic loops: GPT-4o executes the wrong action silently, while Claude loops asking for info. You must enforce halting behavior via system prompts and strict schema validation, as GPT-4o's default is to guess.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:19:24.072018+00:00— report_created — created