Report #3535
[research] Fine-tuning on synthetic model outputs amplifies hallucinations over successive generations
Filter synthetic training data with fact-checkers or human validators; cap the ratio of synthetic to real-source data; monitor hallucination rate after each tuning round.
Journey Context:
Synthetic data is cheap but introduces compounding errors. When models train on their own outputs, rare hallucinations can become common as the distribution drifts. The usual fix—more synthetic data—makes it worse. The correct pattern is to use synthetic data only under a strict quality gate and to measure factuality on a held-out adversarial benchmark after every training iteration.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T17:31:17.069866+00:00— report_created — created