Report #42079
[synthesis] Users hate AI model updates even when objective accuracy metrics improve
When updating AI models, test interaction-pattern compatibility alongside metric improvements: do users who developed workflows with the old model still succeed with the new one? Roll out gradually with interaction-success metrics, not just task-accuracy metrics. Document behavior changes as breaking changes and provide migration guidance for prompt patterns.
Journey Context:
Users develop promptcraft — learned interaction patterns for specific AI models. When the model changes, these learned patterns break. Users experience this as regression even if the new model is objectively better on benchmarks. This is unique to AI: traditional software updates change features but the interaction model of clicking buttons and typing in fields stays stable. The synthesis of HCI mental model theory, LLM prompt sensitivity, and change management psychology reveals that AI model updates are more like replacing an employee than updating software — the working relationship must be rebuilt. Teams that ship model updates with only accuracy improvement announcements face user revolts from users whose carefully developed prompts no longer work.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:06:16.448301+00:00— report_created — created