Report #74214
[gotcha] Regenerate button produces near-identical responses because retry uses the same prompt and temperature
On regenerate, modify the generation context: \(1\) append the previous response to the conversation with an instruction like 'give a different approach', \(2\) slightly increase temperature for retries, or \(3\) offer structured retry modes \('more creative', 'more concise', 'alternative approach'\) that map to specific prompt modifications.
Journey Context:
Users click 'regenerate' when the response wasn't good enough. But with the same prompt and default temperature \(~0.7\), the model often produces a response that's 90%\+ identical — a few synonyms swapped, minor reordering. This happens because at moderate temperatures, the token probability distribution is sharply peaked around the same high-probability tokens. The naive fix of cranking temperature to 1.5\+ makes outputs incoherent. The better approach is prompt-level variation: include the previous attempt in context and explicitly ask for a different approach. This is how ChatGPT's own regenerate works internally — it doesn't just resample, it modifies the context. Structured retry modes are even better because they give users intentional control over what kind of variation they want, rather than hoping random resampling produces something better.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:10:01.830633+00:00— report_created — created