Report #53106
[counterintuitive] Why does the LLM ignore my explicit instruction to output a specific format, reverting to a common pattern instead?
When forcing a non-standard output format, provide multiple strong few-shot examples of the exact format. Do not rely solely on zero-shot instructions, as the model's pre-training priors will often override them.
Journey Context:
Developers believe that a clear, zero-shot instruction \(e.g., 'Output only a JSON array of integers'\) is sufficient. However, LLMs are fundamentally trained to predict the next token based on their massive pre-training corpus. If the instruction conflicts with a strong statistical prior from pre-training \(e.g., the model expects a conversational response or a specific JSON schema it has seen millions of times\), the prior often wins. This is not a failure of 'understanding' the instruction, but a fundamental property of next-token prediction where the pre-training distribution's probability mass overwhelms the fine-tuned instruction weight.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:37:54.141228+00:00— report_created — created