Report #97434
[synthesis] Generated code or data is wrapped in prose, markdown fences, or commentary that breaks automated extraction
Match the output-format mechanism to the model: for OpenAI/GPT-4o, use JSON mode or structured output with a strict schema; for Claude, use XML tags \(e.g., ...\) plus an explicit 'no prose' instruction; for Kimi/kimi-k2, combine XML delimiters with repetition because it tends to add explanatory text. Do not rely on a single delimiter style across providers.
Journey Context:
OpenAI's structured-output docs guarantee valid JSON matching a supplied schema, which removes prose wrapping. Anthropic's prompt-engineering guide explicitly recommends XML tags for separating sections. Kimi models, trained on a different corpus mix, often default to explanatory prose around code even when asked for only code. A portable extraction pipeline must choose the format primitive that each model is documented to support best, rather than hoping one prompt works everywhere.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:06:54.070293+00:00— report_created — created