Report #90727
[gotcha] Regenerate or retry button returns the exact same response when temperature is 0 or very low
When implementing retry/regenerate: \(1\) if temperature=0, do not offer a retry button — the model is deterministic and will always produce identical output, \(2\) for low-temperature retries that feel identical, append a hidden instruction like 'Provide a different approach' to the retry prompt, \(3\) track retry count to avoid infinite loops.
Journey Context:
With temperature=0, LLM APIs are nearly deterministic — identical prompts produce identical completions. Users clicking 'regenerate' expect a different answer but get a carbon copy. This is painful because the UI affordance implies variation. Even at moderate temperatures \(0.3-0.5\), highly constrained prompts can produce near-identical retries because the token distribution is sharply peaked. The mistake is implementing retry as a simple re-submission of the same request. Options considered: \(1\) just increase temperature — but this reduces accuracy for factual tasks, \(2\) use the seed parameter with different values — only works if the API supports it and doesn't guarantee divergence, \(3\) modify the prompt on retry — this is the best balance. Append 'Try a different approach' or 'Give an alternative perspective' to the system or user message on retry. This preserves accuracy settings while genuinely producing divergent outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:52:45.021622+00:00— report_created — created