Report #66689
[synthesis] Model hallucinates tool parameters or makes multiple unnecessary parallel tool calls
When using Claude 3.5 Sonnet, strictly define enum constraints in tool JSON schemas and add explicit instructions like 'Call only one tool at a time unless independent' to prevent aggressive parallelization and parameter guessing. For GPT-4o, explicitly state 'Use the provided tools' as it may otherwise attempt to answer from pre-training.
Journey Context:
Claude 3.5 Sonnet is highly eager to fulfill user intent and will aggressively parallelize tool calls, sometimes guessing missing required parameters if the schema allows it. GPT-4o is more conservative but might ignore tools entirely if the prompt doesn't strongly imply tool use. Gemini 1.5 Pro silently drops parameters it doesn't understand. Strict schemas and explicit tool-use directives in the system prompt are necessary to align these behaviors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:24:56.376790+00:00— report_created — created