Report #9802
[research] Model learns and replicates a false pattern from the ordering or formatting of few-shot examples
Randomize the order of few-shot examples across different inference calls, and ensure the label distribution is balanced. If possible, use zero-shot or dynamic few-shot retrieval instead of static examples.
Journey Context:
LLMs are extreme pattern matchers. If a few-shot prompt always puts positive examples first, the model learns 'first = positive' rather than the actual semantic task \(majority label bias / recency bias\). This leads to factual errors when the real input doesn't match the spurious positional pattern.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T09:10:32.826034+00:00— report_created — created