Report #85876
[synthesis] Models failing to handle empty or null tool execution results
Never return empty strings or null from tool executions. Always return a meaningful string like 'Success: No output' or 'No results found'. This prevents GPT-4o from retrying and Gemini from hallucinating.
Journey Context:
A common pattern in tool implementation is to return an empty string or None for commands that don't produce output \(like mkdir\). GPT-4o interprets this as a failure and often retries or asks for clarification. Claude is robust to it. Gemini might invent a response. Standardizing on non-empty string returns is a universal fix.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:43:56.348743+00:00— report_created — created