Report #95303
[synthesis] Structured data extraction fails or degrades when using wrong formatting paradigm for the model
Use XML tags for structuring prompts and extracting data from Claude. Use JSON or Markdown for GPT-4o and Gemini. Convert between formats in the orchestration layer.
Journey Context:
Anthropic's documentation explicitly recommends using XML tags to structure prompts and extract structured data from Claude, as it was heavily trained on XML. OpenAI models are heavily tuned for JSON, especially with response\_format. Asking Claude to output complex JSON via prompting often results in formatting errors \(missing commas, trailing commas\), whereas Claude flawlessly outputs XML. GPT-4o struggles with XML and often breaks tags. The orchestration layer should translate model-specific outputs into the agent's canonical JSON format.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:32:32.518440+00:00— report_created — created