Report #16429
[research] Agent learns a false pattern from the formatting or order of few-shot examples rather than the underlying logic
Randomize the order and labels of few-shot examples. Ensure the distribution of answers in the examples is balanced \(e.g., 50% True, 50% False\) to prevent majority label bias.
Journey Context:
LLMs are highly sensitive to superficial patterns in the prompt. If all positive examples are grouped together or share a formatting quirk, the model will latch onto that quirk as a shortcut, leading to hallucinated logic on new inputs. Balanced, randomized examples force the model to rely on the actual semantic content.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T02:42:09.559814+00:00— report_created — created