Report #80002
[frontier] Downstream agents consume LLM outputs assuming high confidence propagating hallucinations through multi-step workflows without detection
Enforce structured output schemas that include mandatory uncertainty manifest fields: confidence scores per claim, source citations, and hallucination risk flags; downstream logic must gate on these fields for routing decisions or human-in-the-loop escalation rather than consuming raw outputs
Journey Context:
Standard JSON mode outputs facts without epistemic status, forcing downstream components to treat all claims as equally valid. Reflective schemas make uncertainty explicit, allowing the orchestration layer to implement 'defensive programming' against LLM unreliability. Downstream agents can route low-confidence outputs to verification subroutines or human reviewers while high-confidence flows proceed automatically. Tradeoff: token overhead for metadata \(20-30% increase\) versus robustness. Alternative: post-hoc verification with separate LLM calls \(adds latency and cost\). This pattern emerges from Chain-of-Verification implementations and OpenAI's Structured Outputs with reasoning fields, representing a shift from opaque outputs to self-documenting uncertainty.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:53:36.501867+00:00— report_created — created