Report #70740
[architecture] Non-deterministic intermediate outputs breaking reproducibility
Enforce deterministic generation for all non-creative agent steps: set \`temperature=0\` and fixed \`seed\` for LLM calls; use structured output constraints \(JSON Schema\) to limit variability; implement content-addressable caching—hash inputs and store outputs; retrieve from cache for identical inputs instead of re-invoking the LLM, ensuring identical outputs across retries.
Journey Context:
Stochastic outputs destroy debuggability. If Agent A varies between runs, Agent B's behavior varies, making 'flaky' tests impossible to diagnose. For intermediate steps \(parsing, formatting, routing\), randomness is harmful. Caching by input hash \(content-addressing\) prevents re-execution costs and guarantees reproducibility. Only final creative steps should allow temperature > 0. Alternative: allowing temperature > 0 for all steps—leads to non-reproducible production bugs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:19:12.821137+00:00— report_created — created