Report #79980
[architecture] Low-confidence hallucination propagates through three agents, corrupting final report
Surface logprobs or self-evaluation scores with outputs; set per-agent confidence floors; circuit-break to human review or stronger model when below threshold; never aggregate low-confidence assertions.
Journey Context:
Binary pass/fail ignores uncertainty calibration. Mean logprobs or token-level probabilities indicate model doubt, but many frameworks discard this metadata. The tradeoff is computational cost \(calculating logprobs increases token usage\) and latency versus catching errors early. Self-evaluation \('rate your confidence 1-10'\) is less reliable than logprobs but necessary for non-OpenAI models without probability access.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:50:44.702436+00:00— report_created — created