Report #43038
[frontier] How do I prevent my agent from continuing with degraded performance when the underlying LLM is experiencing latency spikes or quality drift?
Implement LLM-specific circuit breakers that monitor not just latency but output quality drift \(using self-consistency checks or reference comparisons\), opening the circuit to fallback models or graceful degradation when degradation is detected beyond threshold.
Journey Context:
Microservices use circuit breakers \(Hystrix, Resilience4j\) to fail fast when downstream services are unhealthy. LLM applications typically use retries or timeouts, but don't detect 'brownouts' where the model works but quality drops \(hallucinations increase, coherence drops\). An LLM circuit breaker monitors: \(1\) latency p95/p99, \(2\) error rates, and crucially \(3\) quality metrics \(perplexity against baseline, self-consistency across multiple samples, or coherence scores\). When quality drops below threshold \(e.g., consistency < 0.7\), the circuit opens, routing to a backup model or triggering a cache refresh. This prevents the agent from making decisions on degraded outputs. The pattern requires a feedback loop to close the circuit once health returns \(half-open state testing\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T02:42:46.011416+00:00— report_created — created