Report #10396
[research] Model hallucinates answers that mimic the label distribution of few-shot examples rather than the query
Ensure few-shot examples have balanced labels \(e.g., equal true/false\) and vary their formatting. For factual tasks, prefer zero-shot or dynamic few-shot retrieval based on semantic similarity to the query.
Journey Context:
LLMs are extreme pattern matchers. If you provide 5 examples of 'True' facts and 1 'False' fact, the model will be heavily biased toward answering 'True' regardless of the input. Similarly, if examples all end in a certain phrase, the model will force its answer to end that way, even if it requires fabricating facts. Dynamic retrieval breaks the static label bias by providing fresh, balanced context per query.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T10:39:17.219272+00:00— report_created — created