Report #2790
[research] Repeating popular internet myths or deprecated coding patterns as factual truth
Benchmark against TruthfulQA and use system prompts that explicitly instruct the model to be skeptical of common tropes and prioritize official documentation over common phrasing.
Journey Context:
LLMs learn what is likely to be said, not what is true. If a deprecated pattern or myth appears 100x more than the truth in training data, the myth has a higher prior probability. RAG with up-to-date docs overrides the stale prior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T13:57:09.421493+00:00— report_created — created