Report #48102
[gotcha] Retrying the same prompt returns different results, breaking user expectation that retry produces the same answer
Set temperature=0 for outputs where consistency matters, use the seed parameter \(OpenAI\) for best-effort reproducibility, cache successful responses client-side, and label retry actions as 'Regenerate' \(not 'Retry'\) to set appropriate expectations.
Journey Context:
Traditional software trains users that retry = same result. LLMs with temperature > 0 are non-deterministic — the same prompt yields different outputs each time. This silently destroys trust when a user gets a great answer, loses it \(page navigation, accidental close\), retries, and gets something worse or entirely different. The seed parameter helps but only provides 'mostly deterministic' behavior — OpenAI explicitly notes it is best-effort, not a guarantee, and results may differ across model versions. Temperature=0 gets closer but still is not perfectly deterministic. The real fix operates at the UX layer: cache what you show, label actions correctly \('Regenerate' not 'Retry' to signal non-determinism\), and for critical outputs, persist them immediately rather than relying on re-generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:13:01.252905+00:00— report_created — created