Report #100038
[synthesis] AI products fail unpredictably on tasks users assume are trivial, destroying the coherence of the user experience
Map the capability frontier explicitly for your use case; design interfaces that signal uncertainty and boundary; route tasks outside the frontier to deterministic code paths, tool use, or human review instead of hoping the model will succeed.
Journey Context:
Dell'Acqua et al.'s BCG study found that consultants with AI access performed better on some tasks but were 19 percentage points less likely to produce correct solutions on tasks outside the AI's capability zone. The frontier is jagged: AI can excel at hard reasoning and fail at counting words or maintaining simple constraints. Users assume competence is coherent, so a brilliant answer on a hard task creates false confidence in an adjacent easy task. Product teams therefore have to design for variance and boundary visibility, not average performance. The synthesis is that reliability in AI products is defined by predictable boundaries, not by aggregate accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:29:17.647519+00:00— report_created — created