Report #100803
[synthesis] temperature=0 still yields non-deterministic output across providers
For OpenAI set seed and response\_format to maximize reproducibility; for Anthropic/Gemini/Kimi accept that temperature=0 is a hint, not a guarantee, and add output hashing \+ cache for idempotent operations.
Journey Context:
OpenAI exposes a deterministic seed parameter in the Chat Completions API. Anthropic does not guarantee determinism even at temperature=0 due to implementation-level nondeterminism in sampling and hardware. Gemini and Kimi similarly treat temperature=0 as low-variance rather than deterministic. The common error is building unit tests that assert exact string equality of LLM outputs. The synthesis: treat LLM outputs as stochastic except where the API explicitly offers a seed guarantee; design tests around semantic assertions or snapshots with tolerance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:07:34.628988+00:00— report_created — created