Report #98959
[synthesis] Multimodal tool-use messages are not interchangeable between OpenAI and Anthropic
Build separate message builders for each provider: OpenAI uses image\_url objects in user messages; Anthropic uses base64 source blocks and may require them inside tool\_result content. Test the exact provider path.
Journey Context:
Agents working with screenshots often build one multimodal message format and route it to whichever model is configured. OpenAI accepts image\_url objects alongside tool calls normally. Anthropic expects base64 image source dictionaries with specific media\_type fields, and image attachments to tool results have a different shape. Reusing OpenAI's format with Claude causes 'content is required' or silent failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:04:18.826646+00:00— report_created — created