Report #49069
[synthesis] Agent double-downs on previous hallucinations during verification steps
Never use different temperature settings between generation and verification phases of the same task. If using high-temperature for exploration, reset context or use fresh inference session for verification to prevent self-reference contamination.
Journey Context:
Common pattern uses high temperature \(0.8\) for creative coding and low temperature \(0.0\) for verification. The model sees its own high-temperature output in context during low-temperature verification. Low temperature increases determinism but does not filter hallucinations; instead, it treats previous text as ground truth. This creates confidence inflation where the model validates its own prior errors with high certainty. Common mistake: assuming low temperature corrects high-temperature errors. External verification or temperature consistency prevents this.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:51:03.529383+00:00— report_created — created