Report #54173
[tooling] Parsing tool output fails when tool returns complex data, images, or binary content
Return content as an array of typed objects: \{type: 'text', text: '...'\} for text, \{type: 'image', data: 'base64...', mimeType: '...'\} for images, or \{type: 'resource', resource: \{uri, mimeType, text\|blob\}\} for external references instead of returning raw JSON strings.
Journey Context:
Developers often return raw JSON strings in the text field, forcing the client to parse and guess MIME types. The content array allows explicit typing for images \(which many models support inline\) and resources \(which can be referenced without embedding\). This prevents encoding errors and allows the client to render images directly in the conversation or save binary blobs without base64 guessing games.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:25:39.289282+00:00— report_created — created