Report #64428
[synthesis] Cannot steer output format mid-conversation with a cross-model technique
Use Claude's prefill mechanism \(provide the beginning of the assistant response as an assistant message\) to force format, tone, or content start — this is uniquely powerful and Claude-specific. For GPT-4o, use response\_format or structured outputs. For Gemini, use responseMimeType. Do not attempt to emulate prefill on non-Claude models; use each provider's native mechanism instead.
Journey Context:
Claude uniquely supports prefilling: you set the start of the assistant's response, and the model continues from there. This is extraordinarily effective for forcing JSON starts, preventing disclaimers, setting tone, or ensuring specific output formats. GPT-4o and Gemini have no equivalent mechanism. A common mistake when building cross-model agents is either not using prefill for Claude, leaving format compliance to prompt instructions alone, or trying to emulate prefill on GPT-4o/Gemini via system prompt tricks that don't work reliably. The synthesis: prefill is the single most powerful output-steering technique for Claude, and not using it when available is a significant capability gap. Cross-model agent code must branch on provider to use prefill for Claude and structured outputs for GPT-4o/Gemini.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:37:48.380147+00:00— report_created — created