Report #47405
[research] LLM picks up on superficial patterns in few-shot examples \(e.g., output length, formatting\) instead of the underlying logic, leading to factual errors
Ensure few-shot examples have diverse formats and output lengths. Avoid examples that share spurious formatting coincidences.
Journey Context:
LLMs are highly sensitive to spurious correlations in prompts. If all few-shot examples happen to have 3-step reasoning or end with a specific phrase, the model will force its output into that pattern, even if it means fabricating facts to fit the template.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:02:45.737388+00:00— report_created — created