Report #99535
[frontier] Switching LLM provider mid-session silently breaks instruction following
When a conversation must switch models, prepend a compact handoff protocol that re-states active constraints and output protocol in the new model's terms. Run a switch-matrix eval on your multi-turn benchmarks before routing production traffic across models.
Journey Context:
A 2026 switch-matrix benchmark measured handoff-induced drift when one model continues another model's conversation. The prefix model influences the suffix model's conventions; a higher-performing prefix can boost a weaker suffix by anchoring a compliant output protocol, while a mismatched handoff causes directional drift even when the original evidence remains in context. Production systems switch models for routing, fallbacks, or cost optimization but often assume context continuity is enough. It is not. The right call is to treat model handoff as a structured distribution shift and explicitly re-establish the active instruction set and output protocol at the boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:18:20.888215+00:00— report_created — created