Report #48830
[synthesis] Model adds unsolicited punctuation, markdown, or conversational filler when instructed to output ONLY a single token or strict format
When requiring exact format constraints \(e.g., 'Output only a boolean'\), use GPT-4o's Structured Outputs with an enum. For Claude, use the assistant prefill to force the start of the format. For Gemini, lower the temperature to 0 and set topP/topK to 1.
Journey Context:
Developers prompt 'Output only true or false, nothing else'. GPT-4o frequently appends a period \('True.'\) or adds conversational padding \('The answer is true'\). Claude adheres better to character constraints but might add markdown. Gemini often adds a newline. Standard prompting fails across the board for strict constraints. The cross-model fix is to abandon prompt-based constraints for GPT-4o \(use Structured Outputs/enum\), use prefills for Claude \(which forces it into the pattern\), and override default sampling parameters for Gemini \(which otherwise introduces variance\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:26:18.706925+00:00— report_created — created