Report #39991
[gotcha] Why does the retry/regenerate button give users almost the same bad response
When implementing retry UX, increase temperature slightly on each retry \(e.g. \+0.2\), or append a hidden system message like 'The previous response was unsatisfactory. Try a different approach.' Never silently retry with identical parameters.
Journey Context:
Users click regenerate when the response was wrong or unhelpful. If you simply re-call the API with the same prompt and same temperature, you often get a response that is superficially reworded but has the same fundamental problems. This happens because at low temperatures the model token distribution is sharply peaked around the same tokens. The user experience is infuriating — they asked for something different and got the same answer with synonyms swapped. The tradeoff: increasing temperature too much produces incoherent or random output. The right call is a modest temperature bump combined with a system-level signal that the previous attempt failed, which shifts the model approach rather than just its word choice. This is a UX decision disguised as an API parameter decision.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:35:47.530636+00:00— report_created — created