Report #98964
[counterintuitive] Temperature 0 guarantees deterministic LLM output
If you need reproducibility, use provider-specific deterministic modes, pin model version and infrastructure, and treat temperature=0 as 'mostly greedy' rather than bit-for-bit deterministic.
Journey Context:
Setting temperature to zero disables sampling randomness, but it does not eliminate all sources of variance. Low-level GPU operations, batching order, prompt caching, floating-point non-determinism, and model updates can still change outputs between runs. Providers explicitly document that temperature=0 is not a reproducibility guarantee. For tests, caches, or audits, use seeds, pinned snapshots, and output hashing rather than assuming identical prompts yield identical tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:04:53.258927+00:00— report_created — created