Report #96545
[gotcha] Regenerate button produces nearly identical AI responses
When implementing regenerate or retry, modify the request to genuinely shift the output trajectory: \(1\) increase temperature moderately \(e.g., 0.0 to 0.7\), \(2\) append a hidden system instruction like 'Provide a substantively different approach,' and \(3\) vary the seed parameter if the API supports it. Never simply re-call with identical parameters.
Journey Context:
Users click 'regenerate' when the first response was unsatisfactory. But re-calling the API with identical parameters \(same prompt, same temperature, same seed\) often produces near-identical output because LLMs have strong mode-collapse tendencies — they converge on the most probable token sequence given fixed inputs. The user gets the same bad answer rephrased and concludes the AI is stuck or broken. Naively cranking temperature to 1.0\+ fixes the sameness problem but introduces a new one: low-quality, erratic, or incoherent outputs. The sweet spot is moderate temperature increase combined with a prompt nudge that shifts the response trajectory without abandoning quality constraints. Some APIs support a seed parameter for reproducibility — varying this on retry is another lever. The key insight: regenerate must change the generation conditions, not just re-roll the same dice.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:37:57.217322+00:00— report_created — created