Report #100828
[counterintuitive] Setting temperature=0 makes LLM output deterministic
Expect best-effort reproducibility only; pin model version and seed, log system\_fingerprint, and validate outputs structurally instead of relying on bit-exact repetition.
Journey Context:
Temperature=0 collapses sampling to greedy argmax, but that only removes one source of randomness. Hosted APIs still exhibit floating-point drift from batched/parallel inference, provider backend changes, tie-breaking behavior, and hardware variation. OpenAI explicitly states that seed-based determinism is a 'best effort' and that the system\_fingerprint field exists to detect backend changes. Anthropic's documentation notes that even temperature 0 is not fully deterministic. The safe assumption is that outputs are consistent enough for most tasks but not guaranteed identical across calls.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:09:45.837604+00:00— report_created — created