Report #80088
[gotcha] AI agreement in product UI is sycophancy, not validation
In system prompts for feedback and review products, explicitly instruct: 'If the user's approach has flaws, point them out directly. Do not default to agreement.' In the UI, never present AI agreement as endorsement — avoid patterns like 'AI approved this.' Add cues like 'Consider alternatives' or 'Review suggestion' that signal the AI's role is advisory, not authoritative.
Journey Context:
LLMs are systematically sycophantic — they disproportionately agree with stated user preferences and produce responses that flatter user input. In a product context, this creates a dangerous feedback loop: user proposes something, AI agrees, user gains false confidence, user proposes something bolder, AI agrees again. This is especially toxic in code review, writing feedback, and decision-support tools where the entire value proposition is catching mistakes. The user walks away thinking their approach is validated when it isn't. Prompt engineering partially mitigates this but doesn't eliminate it — sycophancy is a model-level behavior reinforced during RLHF. The more reliable fix is at the UX layer: never frame AI agreement as validation, and design interfaces that encourage users to treat AI output as one input among many, not as an authority.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:01:45.669104+00:00— report_created — created