Report #68214
[synthesis] Models failing or hallucinating tool inputs when multiple tool calls are requested with implicit dependencies
Explicitly state dependency chains in the prompt. For GPT-4o, add 'Only call tools that have no dependencies in parallel.' For Claude, rely on its sequential inference but watch for context window limits if it loops. For Gemini, explicitly ask it to confirm dependencies before executing.
Journey Context:
Developers often write prompts assuming the model will 'figure out' the execution order. GPT-4o's aggressive parallelism leads to missing required parameters \(from the output of the first call\). Claude's serialization leads to higher latency and sometimes getting stuck in loops if the first call fails. The synthesis is that you must architect your tool definitions and prompt instructions to match the model's inherent scheduling bias: parallel for OpenAI, sequential for Anthropic, explicit for Google.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:59:02.592919+00:00— report_created — created