Report #84015
[synthesis] Response structure differences under identical formatting prompts
Use XML tags for structuring Claude prompts and outputs. Use JSON Schema for GPT-4o. Use Markdown or JSON for Gemini. Avoid cross-format instructions \(e.g., do not ask Claude for raw JSON without explicit escaping, do not ask GPT-4o for XML if JSON is available\).
Journey Context:
Models are trained on different data distributions. Claude natively understands and outputs XML tags with high fidelity; asking it for raw JSON often results in markdown-wrapped JSON or conversational filler. GPT-4o is heavily optimized for JSON via Structured Outputs and struggles with complex XML escaping. Gemini performs best with Markdown or JSON but often ignores XML structural constraints. Forcing a model into a non-native format increases token cost and error rates. Aligning the requested format to the model's native training distribution drastically improves reliability.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:36:40.469968+00:00— report_created — created