Report #14093
[research] Model learns the wrong pattern from few-shot examples, hallucinating outputs that match the format of the examples but contradict the actual query
Use zero-shot or zero-shot CoT where possible. If few-shot is necessary, ensure examples are highly diverse in content and strictly correct, avoiding any superficial pattern leaks like all answers starting with the same letter or being the same length.
Journey Context:
LLMs are extreme pattern matchers. If few-shot examples share a superficial trait \(e.g., all answers are Yes, or all extracted entities are 3 words\), the model will prioritize matching that trait over correctly answering the prompt. This majority label bias or length bias overrides the actual reasoning required.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T20:41:12.581158+00:00— report_created — created