Report #93829
[research] Factual drift during long-form generation, where the model starts with accurate facts but gradually introduces hallucinations as the context window fills with its own generated text
Break long generation tasks into smaller, independent sub-tasks \(e.g., generate outline, then generate section by section\) and verify facts at each step.
Journey Context:
Autoregressive generation compounds errors. A slightly off phrase early on shifts the conditional probability of subsequent tokens, leading the model down a hallucinatory path. Factually grounding each section independently prevents this compounding drift. Generating 2000 words in one pass is far more prone to hallucination than ten 200-word generations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:04:45.004018+00:00— report_created — created