Report #82937
[research] LLM learns and replicates a false pattern from the few-shot examples provided in the prompt
Randomize the ordering of few-shot examples across multiple inference calls. Ensure few-shot examples do not share superficial formatting traits that the model might latch onto instead of the underlying task logic.
Journey Context:
LLMs are highly sensitive to prompt formatting. If all positive examples in a few-shot prompt end with a period, and negative examples end with a question mark, the LLM will learn punctuation as the classification heuristic rather than the semantic content. This majority label bias or recency bias causes the model to hallucinate based on superficial prompt artifacts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:48:17.038213+00:00— report_created — created