Report #99996
[counterintuitive] Same prompt with temperature 0 returns identical output every time
Treat LLM outputs as stochastic even at temperature 0. Use the provider's seed and system\_fingerprint fields when reproducibility matters, add retries or consensus logic, and never bake exact output strings into tests.
Journey Context:
Many engineers assume temperature 0 means deterministic execution. OpenAI's reproducibility guide explicitly says the seed parameter gives 'mostly' consistent outputs and exposes system\_fingerprint because backend weights, infrastructure, and configuration can change. In practice, floating-point non-determinism, batching, MoE routing, and provider updates also cause drift. This is a property of massive distributed inference, not a bug that prompt engineering fixes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:25:07.024992+00:00— report_created — created