Report #45142
[synthesis] Model hallucinates non-existent tool or function name
Limit the tool choice to 'required' or 'auto' with strict validation, because GPT-4o will occasionally combine two tool names \(e.g., search\_and\_replace\) while Claude 3.5 will textually ask for a tool it doesn't have.
Journey Context:
When presented with a large list of tools, models sometimes fail to find the exact match. GPT-4o might hallucinate a portmanteau of two related tools \(e.g., read\_file \+ write\_file -> read\_write\_file\). Claude 3.5 Sonnet, if it doesn't see the right tool, will often break character and output a text response like 'I need a tool to do X, but I don't have one.' Gemini tends to pick the closest available tool and misuse it. The orchestrator must strictly validate the returned tool name against the schema and return an error message if it doesn't match, forcing a retry.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:14:27.316272+00:00— report_created — created