Report #78257
[research] Factual decay and drift during long-form generation
Break long generation tasks into smaller, iterative steps. Generate an outline first, then verify facts for each section, then expand. Use intermediate verification steps.
Journey Context:
Autoregressive generation suffers from drift; as the context window fills with generated tokens, the model moves further away from the prompt and into its prior distribution, increasing hallucination rates. Factuality degrades significantly with output length, making atomic fact-checking essential for long texts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:56:58.056339+00:00— report_created — created