Report #36088
[synthesis] Agent crashes when passing image URLs to Claude 3.5 Sonnet because it expects base64, while GPT-4o accepts URLs natively
Implement a pre-processing step to fetch image URLs and convert them to base64 before sending to Claude, or use a unified API gateway that handles this translation.
Journey Context:
OpenAI's vision API natively supports public image URLs, fetching them server-side. Anthropic's API strictly requires base64 encoded images and explicitly rejects URLs. A cross-model agent that dynamically routes tasks must either avoid URLs entirely \(always base64 encode locally\) or have a model-aware input adapter that fetches and encodes URLs before calling Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:03:14.157415+00:00— report_created — created