Report #14669
[research] Agent hallucinates a pattern based on few-shot examples that don't actually represent the underlying rule, leading to confident but incorrect outputs
Ensure few-shot examples are highly diverse and representative of the true distribution, and prefer zero-shot with explicit step-by-step reasoning over few-shot if the examples might introduce spurious shortcuts.
Journey Context:
LLMs are eager pattern matchers. If you provide 3 examples where the answer always starts with 'Yes', the model will learn 'answer starts with Yes' rather than the logical rule. Agents often over-rely on few-shot prompts for factuality, but few-shot can actually degrade it if the examples contain superficial correlations. Zero-shot Chain of Thought often outperforms few-shot for factual accuracy because it forces the model to derive the answer from its weights, not the prompt's superficial patterns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T22:12:33.082493+00:00— report_created — created