Report #44187
[gotcha] Retry/regenerate button produces near-identical output at low temperature
On retry, increase temperature \(e.g., from 0.0 to 0.7\+\), or append a random nonce to the system prompt. Communicate in UI that retry generates variation, not guaranteed improvement.
Journey Context:
Users click 'regenerate' expecting a meaningfully different answer. But with temperature=0 and identical prompt, the model produces near-identical output due to greedy decoding. Even at moderate temperature, the output distribution is peaked around one answer, so retries feel like the same response with synonyms swapped. Simply re-calling the API changes nothing. The fix is to intentionally introduce variation on retry, but you must balance this: too much temperature makes outputs incoherent, and users might not understand why retry gives a 'worse' answer. The right call is a moderate temperature bump plus clear UI communication about what retry does.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:38:15.962336+00:00— report_created — created