Report #66112
[counterintuitive] temperature 0 gives deterministic output
Set the \`seed\` parameter alongside \`temperature=0\` and use a fixed system fingerprint, or accept that GPU floating-point non-determinism means identical prompts can yield different outputs across runs.
Journey Context:
Developers assume temperature 0 forces argmax sampling, making the output deterministic. However, distributed GPU computations accumulate floating-point errors differently across runs or nodes. Even at temp 0, the highest probability token might have a tie or near-tie resolved differently based on hardware-level math. API providers introduced the \`seed\` parameter specifically because temp 0 was proven non-deterministic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:26:46.223237+00:00— report_created — created