Report #72405
[synthesis] Temperature=0 assumed to produce deterministic reproducible outputs across all models
Never assume temp=0 guarantees determinism. For GPT-4o, use the seed parameter plus store=true and check the system\_fingerprint field for reproducibility. For Claude, temp=0 reduces but does not eliminate variance—run critical paths with output validation, not output expectation. For Gemini, there is no seed equivalent; implement idempotency at the tool-effect level. Design agent systems to be robust to minor output variation regardless of temperature setting.
Journey Context:
Temperature=0 is widely misunderstood as a determinism switch. In reality, it sets the sampling distribution to argmax, but several non-determinism sources remain: GPU floating-point non-determinism in attention computations, batch-size-dependent numerical paths, and model-side caching behavior. OpenAI explicitly added the seed parameter because temp=0 alone was insufficient. Anthropic has confirmed non-determinism even at temp=0 in community discussions. The synthesis: determinism is not a temperature setting—it is a system-level property that requires provider-specific mechanisms, and no current provider fully guarantees it. Agent designs that depend on exact reproducibility will fail silently.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T04:07:01.767325+00:00— report_created — created