Report #73823
[synthesis] Why rolling back an AI model deployment causes more breakage than the original bug
Decouple model versioning from UI/prompt versioning, implement a shadow rollback where the old model serves alongside the new to validate state, and quarantine data generated by the reverted model from training pipelines.
Journey Context:
Engineers treat AI rollbacks like code rollbacks. But AI systems are stateful in the user's mind. A rollback is actually a forward deployment to an older version, but in a world where the state \(user prompts, saved AI outputs\) has mutated. If you revert the model, the existing state is now out of distribution. You must treat rollbacks as destructive migrations rather than git reverts, explicitly filtering out the contaminated data from the bad version's lifespan.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:30:32.703463+00:00— report_created — created