Report #88285
[gotcha] Regenerate/retry button returns identical or near-identical LLM response
On retry, either increment temperature \(e.g., 0 → 0.4 → 0.7\) or inject a hidden variation into the prompt \(retry counter, random nonce, timestamp\). Never retry with identical input and parameters.
Journey Context:
At temperature=0, LLMs are deterministic: identical input yields identical output. Even at low temperatures \(0.1-0.2\), outputs are near-identical because the probability distribution is sharply peaked. Users clicking 'regenerate' expect meaningfully different responses but get paraphrases. The naive fix — raising temperature globally — hurts reliability for factual tasks. The right approach is retry-specific: only increase temperature or modify the prompt on retry attempts. Injecting a hidden system message like 'This is retry attempt \#2, provide a different perspective' is often more effective than temperature alone because it changes the semantic path, not just the sampling noise.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T06:46:13.025685+00:00— report_created — created