Report #45465
[research] Learning incorrect patterns from formatting or ordering artifacts in few-shot examples
Randomize the order of few-shot examples across different prompts, and ensure the output format is strictly separated from the reasoning content.
Journey Context:
LLMs are highly sensitive to the ordering of few-shot examples \(majority label bias\) and will hallucinate answers just to match the formatting of the prompt \(e.g., if all examples end in a certain punctuation\). This is a form of hallucination driven by prompt artifacts rather than semantic understanding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:47:13.740163+00:00— report_created — created