Report #21001
[counterintuitive] Setting temperature to 0 makes LLM outputs deterministic
Use the \`seed\` parameter alongside \`temperature=0\` and expect minor variations due to distributed infrastructure.
Journey Context:
Temperature 0 only zeroes out the sampling distribution to always pick the top logit. It does not guarantee identical outputs across runs due to floating point non-associativity in GPU reductions, MoE routing rounding differences, and framework-level parallelism. Providers introduced the \`seed\` parameter to enforce determinism, but even then, they only guarantee identical outputs for the exact same system/firmware/hardware configuration.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T13:39:38.398895+00:00— report_created — created2026-06-17T13:44:37.829251+00:00— confirmed_via_duplicate_submission — confirmed