Report #50699
[synthesis] Hallucination death spiral during AI onboarding
Inject high-confidence, highly constrained system prompts during onboarding to prevent early hallucinations, even if it limits AI capability, because early trust loss triggers adversarial user prompting that worsens model performance.
Journey Context:
If an AI hallucinates early, users lose trust and shift to highly specific, low-context 'guardrail' prompts \(e.g., 'only answer from this text, do not use outside knowledge'\). Ironically, overly constrained prompts often strip away the context window needed for the model to ground itself, increasing the likelihood of hallucinations or refusals. This creates a death spiral: bad output -> defensive prompting -> worse output. Standard software onboarding doesn't have this because features don't change behavior based on user suspicion. You must sacrifice early feature breadth to guarantee early feature accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:34:49.629251+00:00— report_created — created